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1.0  INTRODUCTION 


Current  generation  automated  systems  have  only  a  limited  capability  for  interacting 
with  their  environment.  This  is  due,  in  large  part,  to  the  fact  they  cannot 
understand  their  environment.  The  lack  of  understanding  is  not  caused  by  a  lack  of 
data.  A  variety  of  sensors  are  available  to  capture  information  in  digital  form.  The 
problem  is  to  understand  this  information. 

The  goal  of  this  research  was  to  develop  new  tools  to  aid  in  the  automated  analysis 
of  two  dimensional  arrays  of  pixels.  Such  arrays  can  be  produced  by  optical, 
infrared,  or  radar  sensors.  Understanding  pixel  arrays  by  examining  individual  pixels 
is  difficult  because  each  pixel  contains  only  a  shred  of  evidence  about  the  entity  to 
which  it  belongs.  Looking  at  all  the  pixels  together  usually  presents  too  complicated 
a  mass  of  information  to  be  easily  understood.  For  these  reasons  our  theoretical 
development  and  experimentation  have  all  been  based  on  the  GBT  method  of 
successively  decomposing  two  dimensional  space.  By  using  GBT  cells  of  varying 
sizes,  we  are  able  to  find  areas  of  the  image  that  contain  enough  pixels  to  represent 
something  significant  but  not  so  many  as  to  defy  comprehension. 

The  tools  developed  during  this  research  are  the  function  f  that  provides 
information  about  shape  and  the  vector  T  that  provides  a  local  quantification  of 
image  texture.  High  values  of  f2  over  a  region  indicate  that  the  region  contains  a 
solid,  well  compacted  object,  e.g,  a  rectangle  or  ellipse.  Low  values  indicate  the 
image  is  fragmented.  In  the  context  of  GBT,  f2  tells  whether  a  region  can  be 
understood  as  a  whole  or  whether  it  must  be  decomposed. 

The  T  vector  allows  the  computation  of  the  texture  of  any  region  as  a  convex  sum  of 
the  textures  of  its  constituent  parts.  Preliminary  results  have  shown  that  what  the 
T  vector  measures  corresponds  closely  to  the  human  perception  of  visual  texture. 

The  result  of  this  research  is  that  two  valuable  tools  for  automated  image 
understanding  have  been  developed  and  proven  valuable  in  limited  experimentation. 
More  work  is  required  to  see  what  further  tools  can  be  developed  and  to  exploit  the 
full  power  of  the  tools  we  have  now. 
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2.0  STATEMENT  OF  THE  PROBLEM 


One  of  the  major  weakness  of  today's  automated  systems  is  their  lack  of  ability  to 
convert  analog  data  into  internally  usable  form.  This  applies  particularly  to  two 
dimensional  images.  There  are  important  military  applications  for  understanding  the 
content  of  imagery  from  a  variety  of  sources:  images  from  camera  carrying 
platforms,  radar  images,  and  infrared  images.  Current  automated  techniques, 
however,  have  only  primitive  capabilities. 

The  capability  for  converting  analog  images  to  digital  form  as  an  array  of  pixels 
exists  and  is  relatively  mature.  Pixel  digitization  is  even  accomplished  in  some 
systems  at  the  time  of  image  capture.  The  question  is:  what  do  all  those  pixels 
mean?  Taken  individually,  each  pixel  can  contribute  only  the  tiniest  understanding 
of  the  overall  image  content.  Yet  the  totality  of  pixels,  taken  together,  usually  are 
too  complicated  for  automated  understanding.  What  is  required  is  a  method  of 
examining  regions  of  various  sizes  and  shapes  so  that  entities  that  are  both 
significant  and  simple  can  be  isolated  from  the  rest  of  the  image  and  analyzed. 

The  research  reported  here  lays  the  foundation  for  performing  this  process.  It  uses 
the  Generalized  Balanced  Ternary  (GBT)  method  of  structuring  space  into  a  cellular 
hierarchy  as  the  basis  for  analyzing  the  behavior  of  picture  functions.  It  shows  how 
to  use  shape  descriptors,  both  classical  and  original,  to  understand  the  nature  and 
complexity  of  the  picture  function  at  cells  throughout  the  GBT  hierarchy.  It  further 
gives  techniques  for  piecing  together  the  rigidly  structured  GBT  cells  to  form 
regions  of  arbitrary  shape. 

Besides  its  innovative  use  of  the  GBT  structure  in  analyzing  imagery,  two  new  tools 
have  been  derived  during  this  research:  a  function  called  V-  which  gives  information 
about  the  shape  of  an  image  within  a  region  and  a  quantification  of  regional  texture 
as  a  vector. 

These  new  tools  and  the  easy  ways  they  interact  within  the  GBT  hierarchy  provide  a 
significant  advance  along  the  path  toward  automated  image  understanding. 


3.0  SCOPE 


The  research  performed  under  this  contract  was  done  from  April  1,  1982  to 
October  1,  1982  by  Or.  Dean  Lucas,  Dr.  Laurie  Gibson,  Mr.  Tom  Bundy,  Mr.  Day 
Elston,  and  Mr.  Terry  Elmore.  Past  research  in  the  mathematics  of  Generalized 
Balanced  Ternary  and  its  application  to  processing  binary  images  was  extended  by 
this  work.  In  addition,  commercial  and  experimental  software  systems  previously 
developed  at  Interactive  Systems  Corporation  were  used  in  the  project.  The 
objectives  of  this  research  were  as  follows: 

1.  Demonstrate  the  capability  of  the  Generalized  Balanced  Ternary  approach  to 
extract  from  pixel  data  the  features  and  regions  of  common  texture  in  an 
aerial  photograph. 

2.  Derive  aggregate  measures  for  the  representation  of  cartographic  information 
to  be  stored  in  a  data  file  used  in  feature  recognition  and  change  detection. 

The  research  proceeded  in  these  steps: 

1.  Convert  existing  software  for  scanning  and  processing  binary  images  to  handle 
the  scanning  of  photographs.  Develop  histograming,  image  enhancement, 
display,  and  hard  copy  factions. 

2.  Define  texture. 

3.  Build  software  for  computing  texture  and  test  on  photographs. 

v  4.  Build  software  for  the  statistical  analysis  of  texture  (compute  parameters  of  a 

distribution  and  Mahalanobis  distance). 

5.  Analyze  the  texture  definition  and  its  ability  to  distinguish  intuitive  textures. 

6.  Define  aggregate  descriptors. 

7.  Modify  existing  software  to  compute  these  descriptor  functions  for  the 
scanned  images. 

8.  Test  the  descriptors  on  actual  images  and  refine  definitions.  Select  those  that 
seem  most  useful. 

During  the  course  of  this  project  the  following  papers  were  presented  which 
announced  preliminary  results  of  the  research. 

1.  "Spatial  Data  Processing  using  Generalized  Balanced  Ternary"  presented  by 
Dean  Lucas  at  the  IEEE  Computer  Society  Conference  on  Pattern  Recognition 
and  Image  Processing,  June  1982,  Las  Vegas,  Nevada. 

2.  "Automated  Data  Capture"  presented  by  Laurie  Gibson  at  the  Symposium  on 
Automation  Technology  for  Management  and  Productivity  Advancement 
through  CAD/CAM  and  Data  Handling,  November,  1982,  Naval  Postgraduate 
School,  Monterey,  California. 

{ 
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4.0  DESCRIPTION  OF  THE  TECHNOLOGY 


4.1  Picture  Functions 


The  goal  of  this  research  was  to  develop  tools  to  enable  a  computer  to  understand 
the  content  of  a  picture.  In  order  to  understand  the  progress  of  the  research  and  the 
nature  of  the  tools  that  were  developed,  it  is  necessary  to  start  at  the  beginning, 
with  the  basic  notion  of  a  picture  function. 

For  our  purposes,  a  picture  function,  p,  is  a  real  valued  function  defined  on  the 
plane.  In  what  follows  we  will  address  the  points  in  the  plane  as  complex  numbers, 
i.e.,  numbers  of  the  form  x  +  yi  where  x  and  y  are  real  numbers  and  i^  =  -1.  In 
practice,  the  values  that  a  picture  function  can  take  on  are  often  restricted. 
Picture  functions  which  take  on  the  values  0  and  1  only  are  called  binary  picture 
functions.  We  will  use  the  term  gray  level  when  speaking  of  a  picture  function  that 
takes  on  integer  values  from  0  to  255  inclusive.  Figure  1  is  an  illustration  of  a 
binary  picture  function.  The  illustration  is  meant  to  be  interpreted  as  follows:  at 
every  point  at  which  ink  is  present  on  the  paper  the  picture  function  takes  on  the 
value  1,  where  ink  is  not  present  it  takes  on  the  value  0. 


Figure  1.  An  Illustration  of  a  Binary  Picture  Function. 


Picture  functions  can  be  added  together,  multiplied  together,  and  multiplied  by 
scalars  to  yield  new  picture  functions.  In  what  follows  we  will  take  advantage  of 
these  algebraic  properties  of  picture  functions  to  try  to  express  complicated  picture 
functions  as  sums  or  differences  of  simple  picture  functions.  For  instance,  the 
picture  function  illustrated  in  Figure  1  might  be  expressed  as  the  sum  of  two  more 
easily  understandable  picture  functions:  one  an  ellipse,  the  other  a  rectangle. 
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4.2  Classical  Techniques 

We  are  interested  in  developing  tools  for  determining  the  content  of  pictures.  Our 
techniques  are  based  on  some  relatively  well  known  analysis  functions.  In  this 
section  we  restate  these  functions  to  remind  the  reader  of  their  existence  and 
introduce  our  notation.  In  what  follows,  p  is  a  picture  function  on  the  complex 
plane,  z  =  x  +  yi  is  a  variable  complex  number,  c  is  a  fixed  complex  number,  and  R  is 
a  region  of  the  plane. 

For  q  =  0,  1,  *  ‘  * ,  and  m  =  0,  1,  '  ’  * ,  the  qth  moment  of  pm  about  c  in  the 
region  R  is: 


M(q,c,pm,R)  =  /  (z-c)qp(z)mdz 

zeR 


Let  us  illustrate  and  name  some  of  the  simpler  of  these  moments.  (The  question  of 
the  existence  of  these  integrals  is  not  addressed  since  it  has  no  practical 


consequences.) 

M(0,c,p°,R) 

=  I 

ZER 

M(0,c,p,R) 

=  f 

zeR 

M(l,c,p,R) 

-  I 

zeR 

M(2,c,p,R) 

=  I 

zeR 

There  are  several  other  classical  functions  that  are  related  to  or  derived  from 
moments.  We  define  and  name  those  functions  that  we  will  need  later. 


u(p,R) 

Var(p,R) 

S(c,p,R) 

c*(p,R) 


=  MCOjC.p^RJ/N/KOjCjP^R)  =  the  mean  of  p  on  R. 


2  0  2 

=  M(0,c,p  ,R)/M(0,c,p  ,R)  -  y(p,R)  =  the  variance  of  p  on  R. 


=  f  |z-c  l2  p(z)dz  =  the  scalar  moment  about  c  of  p  on  R. 
z£R 

* c  *  afefffS  *  ;  2  p(2)d2/ ;  p(2)d2 


zeR 


zsR 


=  the  centroid  of  p  on  R  (note  that  the  centroid  is  independent  of 
the  fixed  point  c). 
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For  many  applications  we  will  wish  to  use  the  centroid  c  (p,R)  as  the  fixed  point  c 
about  which  we  compute  moments.  Below  we  state  several  classical  results 
concerning  moments  about  centroids  of  picture  functions. 

Xj  (p,R)  =  J4(S(c\p,R)  +  |M(2,c\p,R)j)/M(0,c\p,R) 

=  the  primary  eigenvalue  of  p  on  R. 

X2(p,R)  =  tt(S(c\p,R)  -  |M(2,c*,p,R)|)/M(0,c\p,R) 

=  the  secondary  eigenvalue  of  p  on  R. 

* 

M(l,c  ,p,R)  =  0.  In  words,  the  first  moment  about  the  centroid  is  zero. 

Ax(p,R)  =  h( angle  of  M(2,c  ,p,R) 

=  the  angle  of  the  principal  axis  of  inertia  of  p  on  R. 

It  rnay  happen  that  we  have  computed  moments  and  the  scalar  moment  about  a 
point  c  which  is  not  the  centroid  (the  origin  for  instance).  Since  the  second  moment 
and  the  scalar  moment  about  the  centroid  are  needed  for  computing  eigenvalues,  it 
is  handy  to  have  a  means  of  obtaining  them  directly.  The  formulas  below  provide 
such  a  means: 

M(2,c*,p,R)  =  M(2,c,p,R)- M(l,c,p,R)2/M(0,c,p,R) 

S(c*,p,R)  =  S(c,p,R)  -  M(l,c,p,R)  Vi(l,c,p,R)/M(0,c,p,R), 

where  M(1  ,c,p,R)  is  the  complex  conjugate  of  M(l,c,p,R) 

4.3  Examples 

In  order  to  show  the  utility  of  these  classical  analysis  functions  and  to  lead  into  the 
discussion  of  other  functions,  it  is  worth  looking  at  some  simple  examples. 

Let  p  be  the  binary  picture  function  representing  a  rectangle  of  half  height  a  and 
half  width  b  centered  at  the  origin.  Figure  2  illustrates  this  picture  function.  Some 
relevant  statistics  for  this  picture  function  have  been  computed  and  are  shown 
beside  the  illustration.  Figures  3  and  4  show  the  comparable  illustration  and 
statistics  for  an  ellipse  and  a  rhombus  both  centered  at  the  origin  with  semiaxes  a 
and  b. 

I  weight  =  4ab 

_  first  moment  =  0  .  ,  , 

second  moment  =  ?  (b  a  -  a  b) 


in  mvmuiiL  -  v  ^  ^  ^ 

second  moment  =  j  (b  a  -  a  o) 
4  3  3 

scalar  moment  =  j  (b  a  +  a  b) 

.  r  a^/3  if  a  >  b 
1 1  -  l 

bZ/3  if  b  >  a 
=  |  b^/3  if  a  >  b 
a  /3  if  b  >  a 


Figure  2.  A  Binary  Picture  Function  of  a  Rectangle  centered 
at  the  Origin  together  with  significant  Statistics. 
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— H 


weight  =  trab 

first  moment  =  0  ^  ^  ^ 

second  moment  =  ^  (b  a  -  a Jb) 

scalar  moment  =  ^  (b^a  +  a^b) 
X  =  {  if  a  >  b 

b V4  if  b  >  a 

X  =  {  if  3  >  b 

a2/4  if  b  >  a 


Figure  3.  The  Comparable  Picture  Function  and  Statistics  for  an 
Ellipse  centered  at  the  Origin  with  Semiaxes  a  and  b. 


weight  =  2ab 

first  moment  =  0  ^  ^ 

second  moment  =  ^  (b  a 


scalar  moment 


1 

5  (b  a 


_  r  a  /6  if  a  >  b 
]  "  l  ? 

b  /6  if  b  >  a 

b2/6  if  a  >  b 

a2/6  if  b  >  a 


X,  =  { 


•  a\) 
d\) 


Figure  4,  The  Comparable  Picture  Function  and  Statistics  for  a  Rhombus 
centered  at  the  Origin  with  Semiaxes  a  and  b. 


To  get  a  feel  for  the  computation  involved  and  to  provide  one  more  useful  example, 
let  us  explicitly  derive  the  moments  and  scalar  moment  for  a  regular  n-gon  of  outer 
radius  R.  Specifically,  for  each  integer  n  greater  than  two,  we  will  construct  a 
regular  n-gon  centered  at  the  origin  from  n  isosceles  triangles  similar  to  the 
triangle,  T,  shown  in  Figure  5.  The  n-gon  is  built  as  a  union  of  n  of  these  triangles 
each  successive  triangle  being  rotated  through  an  angle  of  2ir/n  from  the  position  of 
its  predecessor.  Thus,  the  set,  S,  of  points  in  the  n-gon  is: 

(1)  S  =  {zej27n/n  |  zeT,  j  =  0,  1,  *  *  \  n-1 } 

where  T  is  the  triangle  shown  in  Figure  5  and  i^  =  -1.  The  picture  function,  p,  which 
describes  this  n-gon  takes  the  value  1  on  all  points  in  S  and  0  elsewhere. 
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(2) 


The  qth  moment  of  pm  about  the  origin  is  computed  as  follows: 

M(q,0,pm,C)  =  J  zq  p<z)mdz  ,  by  definition 

zeC 

=  /  Zq  p(z)dz,  since  p(z)m  =p(z) 

zeC 

s  ;  z<*  dz,  since  p(z)  =  1  within  the  n-gon,  0  elsewhere 

zeS 

s  ni  J  zd  eJ q2iri/n  dz,  from  (1) 
j=0  zeT 

=  nZl  ejq2iri/n  /  zq  dz- 
j  =0  zeT 

tet  us  examine  the  first  portion  of  ft*  expression,  it  is  a  geometric  series 

ln  en2»i/"  ,t  q  is  a  multiple  of  n  then  e"1'"1"  •  1  and  the  senes  sums  to  n.  It  q 

is  not  a  multiple  of  n  then  e',2"',n  *  1  and  the  following  holds. 

q2iri 


njl  ejq2ui/n  s  1 
j=0 


e~*~  ,  0  since  e'?27”  -  1  and  the  denominator  is  not 

,  q2ni/n 
1  -  e 


zero. 


Now  let  us  compute  the  integral  which  is  the  second  part  of  (2) 
RcosCtr/n)  xtan(-rr/n) 


J  zqdz  =  J 
zeT  0 


j  (x  +  yi)^  dy  dx,  since  these  limits 

-xtan(ir/n) 


describe  T 


(1  +  itan(Tt/n))q+I  -  (1  -  itantn/n))^  R<3*2  coSQ+2(ir/n) , 

*  - - - "i  (q  ♦  L)(q“  2) 

by  standard  integration. 

From  these  two  considerations  we  can  describe  all  moments  of  the  n-gon  of  outer 
radius  R  described  above: 

0,  if  q  is  not  a  rruitiple  of  n 

<s>  M(„o,pm,c)  .  (  t  j  u  ;  ‘“"WaU— 

otherwise. 
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In  particular  the  first  and  second  moments  of  the  n-gon  are  always  zero  since  n  must 
be  greater  than  two.  The  scalar  moment  is  computed  similarly: 


S(0,p,C)  =  J  |z|^  p(z)dz,  by  definition 

zeC 

2 

=  J  |  z  |  dz,  since  p(z)  =  1  in  S  and  p(z)  =  0  elsewhere 
zeS 


=  2  /  }ze^ q2Tri j 2  dz,  from  (1) 

j=0  zeT 

=  E  l  |z|2d7,  since  |e^Z7ri^n|  =  1 
j=0  zeT 

Rcos(7r/n)  xtan{ir/n)  2  2 

=  n  /  J  x  t  y  dy  dx 

0  -xtan(ir/n) 

=  ftn(tan(n/n)  +  tan^(Tt/n)/3)R^cos^(iT/n) 

Since  the  first  moment  of  the  n-gon  is  zero  its  centroid  is  at  the  origin  and  the 
moments  we  have  just  computed  are,  in  fact,  moments  about  the  centroid.  By 
examining  the  formulas  for  eigenvalues  given  above,  we  can  see  that  the  effect  of 
the  second  moment  being  zero  is  that  both  eigenvalues  are  equal  and  are  explicitly 
computed  as  follows: 

X1(p,C)  =  X2(p,C)  =  2  M(o|o’p!c) 


=  14(1+  tan^(ir/n)/3)R^cos^(ir/n) ,  by  (3)  and  (4) 
The  results  of  these  computations  are  summarized  in  Table  1. 


2  2  2 

weight  =  n  tan  (ir/n)  R  cos  (n/n)  =  nR  sinCxr/n)  cos(n/n) 
first  moment  =  0 
second  moment  =  0 

scalar  moment  =  Hn(tan(n/n)  +  tan^(n/n)/3)  R^  cos^(it/n) 

Xj  =  X2  =  K  (1  +  tan^(ir/n)/3)  R^  cos^(tf/n) 


Table  1.  Basic  Statistics  for  an  N-gon  of  Radius  R. 


As  a  final  example,  we  will  compute  the  usual  classical  statistics  for  the  binary 
picture  function  illustrated  in  Figure  6.  The  picture  function  takes  the  value  1 
within  each  of  the  n  rectangles  shown  and  is  0  elsewhere.  The  computational  steps 
will  be  somewhat  more  abbreviated  than  in  the  previous  example,  leaving  the  reader 
to  fill  in  the  details. 

4 


B 

jj§ 

m 

• « * 

■ 

0 

h 

in 

i 

■ 

2 

mi 

n-l 

||J§ 

t 


Figure  6.  N  identical  rectangles  aligned  and  equally  spaced.  Rectangles 
are  of  height  2h  and  width  w  (0  £  w  £  1). 


M(q,0,p,C)  =  J  zq  p(z)dz 
zeC 

=  z  IJ:+W  J  [J  (x  +  yi)q  dy  dx 
j  =0  J 


1  V  (V  (q*z)wk((j+hi)q+2-k  -  (j-hi)q+2*k)) 


=  i  (q+1)  (q+2)  \L=[  '  k 

In  particular,  by  substituting  q  =  0,  1,  and  2,  we  get: 

1  n_1 

weight  =  zer  E  4hiw  =  2nhw 
21  j=0 

I  n- 1  2 

first  moment  about  0  =  y-r  E  (I2jhiw  +  6hiw  ) 

61  j  =0 


=  nhw(n  +  w  -  l) 


n-1 


1  11  2  3  2  3 

second  moment  about  0  =  -ryr  E  ( 24 j  hiw  -  8h  iw  +  24jhiw  +  8hiw  ) 

Ul  j=0 
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3n  +  1  -  2h2  +  3rw  -  3w  +  2w2) 


=  ^  (2n2  - 

Computing  the  scalar  moment  about  the  origin  goes  as  follows: 

S(0,p,C)  =  /  |  z  j 2  p(z)dz 

"eC 


fj+w  r  h  2  2.  , 

=  T  J  J  .  x  +  y  dy  dx 

j=0  J 

-  (2n2  -  3n  +  1  +  2h2  +  3nw  -  3w  +  2w2) 

From  the  weight  and  first  moment  we  obtain  the  centroid: 

*v  nhw(n  +  w-l)  n+w-1 

c  (p,c)  =  - 2SFS? -  =  - 2 - 

Then  the  second  moment  and  scalar  moment  about  the  centroid  are  computed  from 
the  formulas  provided  above: 

M(2,c*,p,C)  =  M(2,0,p,C)- M(t,0,p,C)2/M(0,0,p,C) 

nhw  ,2  2  .  2x 

=  -g-  (n  +  w  -  1  -  4h  ) 

S(c*,p,C)  =  S(0,p,C)-  M(l,0,p,C)M  (l,0,p,C)/M(0,0,p,C) 

nhw  /  2  2  .  2\ 

-  -g-  (n  +  w  -  1  +  4h  ) 

In  this  example  Vl(2,c  ,p,C)  is  a  real  number  which  may  be  positive  or  negative 
depending  on  the  size  of  h.  Therefore,  |  M(2,c  ,p,C)  |  =  ±M(2,c  ,p,C).  It  follows 
that  the  eigenvalues  and  X£  are  the  maximum  and  minimum,  respectively,  of 
the  following  two  expressions: 


.4  f  and 


}  = 


max 


{  04(S(c*,p,C)  ±  M(2,c*,p,C))/M(0,c\p,C) 

min 


max  .  -  .. 

~  ‘  1— T5  \n  +  w  -  1  +  4h 

min 

±  (n2  -t-  w2  -  I  -  4h2))/2ntav) 


max  2  2  .  .2 

;  /n+w-Ihv 

=  t  « - n — >  rK 


min 


We  now  wish  to  use  some  of  the  classical  pattern  descriptors  that  we  have  been 
discussing  to  derive  additional  information  about  shape.  The  first  three  examples  in 
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the  previous  section  provide  the  motivation.  These  were:  a  rectangle,  an  ellipse,  and 
a  rhombus,  each  with  semiaxes  of  length  a  and  b.  In  each  case  the  weight  was  a 
multiple  of  the  products  of  the  lengths  of  the  semiaxes  by  a  factor  that  we  will  call 
s.  In  each  case  the  principal  and  secondary  eigenvalues  were,  respectively,  the 
squares  of  the  semimajor  and  semiminor  axes  divided  by  a  factor  that  we  will  call  t: 

weight  -  sab 

Xj  =  max  {a2/t,  b2/t} 

X2  =  min  {a2/t,  b^/t } 

For  these  three  examples  s  and  t  take  on  the  values  given  in  Table  2.  Although  the 
relations  described  above  are  not  true  for  all  picture  functions  (a  and  b  are  not 
always  well  defined),  their  essence  can  be  exploited  in  deriving  a  general  shape 
description.  Notice  that: 

weight2  =  s2a2b2 

and  XjA2  -  a2b2/t2 

thus,  .  sV. 

X ,  A* 


Weight 

IVV 

s 

t 

f 

f2 

Rectangle 

4ab 

{a2/3,b2/3} 

4 

3 

6 

36 

Ellipse 

nab 

{  a2/4,  b2/4  } 

IT 

4 

21T 

4n2 

Rhombus 

2ab 

{  a2/6,  b2/6  } 

2 

6 

6 

36 

Table  2.  Statistics  for  the  Rectangle,  Ellipse,  and  Rhombus. 


The  left-hand  side  of  this  expression  contains  terms  that  are  computable  fcr  any 
picture  function  while  the  right-hand  combines  two  unknowns  which  individually 
contain  information  about  the  shape  of  certain  binary  picture  functions.  With  the 
hope  that  the  left-hand  side  may  be  of  use  as  a  shape  descriptor  for  arbitrary  picture 
functions,  we  make  the  following  definition: 

«p,r)  .  < jaiSssjusiL-)* 

4XI(p,R)X2(p,R) 

and,  therefore,  f(p,R)2  = 

4X1(p,R)X2(p,R) 
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The  four  in  the  denominator  is  inserted  to  make  some  other  expressions  for  f2  less 
cluttered.  For  instance,  by  substituting  the  formulas  for  X^  and  X^  in  terms  of 
moments  into  the  above  definition  for  f 2,  one  obtains: 


f2(p,R>  -  — - ^C--^ - -J- 

S(c  ,P,R)Z  -  Im(2,c  ,p,R)r 

In  what  follows  let  us  consider  the  picture  function,  p,  and  the  region,  R,  to  be 
implicit  and  suppress  them  from  the  notation.  Thus,  the  above  expression  becomes: 


f2  = 


M(o,cy 

S(c*)2  -  |m(2,c*)!2 


To  express  the  f z  function  in  terms  of  moments  about  an  arbitrary  point,  c,  we  make 
the  following  observations: 

M(0,c*)^  =  M(0,c)^  for  any  c. 

c/„*\2  M(l,c)  M(l,c) *2 

S  c  >  =  <s(c) - 

_  sCc)2  _  2S(c)  M(l,c)  M(l,c)  +  M(l,c)2  M(l.c)2 

M(0,c)  M(0,c)2 

lM(2,c  )|2  =  (M(2,c)  -  ^j^j-)(M(2,c)  - 

=  M(2,c)M(2,c)  -  M(2,c)~— -  M(2,c)1— 


M(0,c) 


M(0,c) 


M(0,c)i 


therefore 

S(c*)2  -  |m(2,c*)  | 2  =  S(c)2  -  |M(2,c)  | 2  +  ^g~y(M(2,c)M(l,c)2  + 

ffl(2,c)MU,c)2  -  2S(c)|M(l,c)|2) 
and 


S(c)2  -  lM(2,c) i 2  +  (M(2,c)M(l,c)2  +  Si(2,c)M(2,c)2  -  2S{c)1m(1,c)|2)/M(0,c) 

Table  2  contains  values  of  f  and  f  for  the  rectangle,  ellipse,  and  rhombus.  Notice 
that  the  values  are  independent  of  the  lengths  of  the  semiaxes,  a  and  b,  and  are 
identical  for  the  rectangle  and  rhombus.  This  is  no  accident  as  we  shall  see  when  we 
discuss  invariants.  Before  going  on  let  us  see  what  can  be  learned  by  applying  the  f^ 
function  to  our  other  two  examples:  the  regular  n-gon  and  the  aligned  rectangles.  For 
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2 

the  n-gon,  we  use  the  definition  of  f 4  in  terms  of  moments  about  the  centroid  and 
take  advantage  of  the  fact  that  the  second  moment  about  the  centroid  is  zero: 

t2 _ A  iCO.c*) 4 

S(cV  -  |M( 2 , c  )|2 

_ (n  tan(fl/n)  R2  cos2(Tr/n))** _ 

(>4n(tan(iT/n)  +  tan  ( tt/n ) /3)  R  cos  (n/2)) 

4n2  tan2(rc/n) 

(1  +  tan2(n/n)/3)2 
«  2n  tan(ir/n) 

so,  i  =  - 5 - 

1  +  tan^(ir/n)/3 

Table  3  gives  the  values  of  f  and  f^  for  the  regular  n-gon  as  n  goes  from  3  to  30. 
Some  aspects  of  the  table  are  worth  examining.  For  an  equilateral  triangle  (3-gon),  f  ^ 
is  exactly  27.  For  a  square,  f 2  is  36.  This  latter  result  was  amply  clear  from  Table  2 
since  a  square  is  a  special  case  of  both  a  rectangle  and  a  rhombus.  For  a  regular 
hexagon,  f^  is  38.88  (223^/5^  exactly).  This  result  will  be  of  interest  later.  As  for 
the  rest  of  the  table,  it  appears  that  the  values  of  f  are  increasing  and  converging.  By 
examining  the  derivative  of  f  and  the  limit  of  f  as  n  goes  to  infinity  one  can  prove 
that  f  is  an  increasing  function  of  n  that  converges  to  2n.  That  f  converges 
to  2tt  for  an  n-gon  was  to  be  expected  from  Table  2  since  f  on  an  ellipse,  hence  a 
circle,  was  2it. 
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Table  3.  Values  of  f  and  f2  for  the  regular  n-gon  as  n  goes  from  3  to  30. 

Let  us  see  how  f2  behaves  on  our  last  example  from  the  previous  section,  ^he 
alignment  of  n  identical  rectangles  shown  in  Figure  6.  Using  the  formula  for  in 
terms  of  moments  about  the  centroid, 


_  36(2nhv) _  _ 

(nhw)^((n^  +  w^  -  1  +  4h^)^  -  (n^  +  w^  -  1  -  4h^)^) 
36  x  16(nlw)^ 

- - x—x - *- 

4(4nV  +  4wV  -  4n  ) 

,,2  2 
36n  w 

i — 2 — : 

n  +  w  +1. 
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Recall  that  n  is  a  positive  integer  giving  the  number  of  rectangles  and  w  is  a  number 
between  0  and  1  giving  the  width  of  each  rectangle.  When  n  or  w  is  1,  takes  the 
value  36.  For  the  former  case  there  is  just  one  rectangle  and  in  the  latter  the  n 
rectangles  are  merged  into  one  long  rectangle.  Therefore,  the  value  36  is  consistent 
with  our  earlier  observations  on  rectangles. 

f  decreases  as  w  decreases,  going  to  zero  as  w  goes  to  zero.  This  property  gives 
some  insight  into  the  behavior  of  f2:  low  values  of  f2  on  binary  picture  functions 
indicate  relatively  small  dark  areas  separated  by  relatively  large  spaces,  f2  decreases 
as  n  increases,  converging  to  36w2  as  n  goes  to  infinity.  This  means  that  whenever  n 
is  large,  f/6  is  a  good  estimator  for  the  proportion  of  darkened  area  along  a  line  of 
identical  rectangles. 

This  example  is  of  more  than  academic  interest  since  lines  of  identical  (or  almost 
identical)  rectangles  occur  frequently  in  applications.  When  h  is  small  compared  with 
w,  a  line  of  rectangles  looks  like  a  dashed  line.  When  h  is  very  large  compared  with 
w,  a  line  of  rectangles  looks  like  a  family  of  parallel  lines.  The  value  of  f  ,  however, 
is  independent  of  h.  This  is  no  accident  as  will  be  evident  below. 


Table  4  lists  the  formulas  we  have  derived  for  f^  based  on  our  examples  together  with 
some  other  formulas  that  the  reader  is  encouraged  to  derive  independently. 


Shape  Described  by  Binary  Picture  Function 

f2 

Rectangle 

36 

Ellipse 

4u2  (=39.4784) 

Rhombus 

36 

Regular  n-gon 

^  2ntan(u/n)  ^2 

1  +  tan2( ^/n)/3 

Line  of  n  rectangles  of  width 
w  0  <  w  <  1) 

,,2  2 

36n  w 

~i — 2 — : 

n  +  w  -  1 

Semicircle 

4 

—Jl -  (=  35.3124) 

9tT  -  64 

Annulus  of  inner  and  outer 
radii  r  and  R 

4 IT2  (R;  ■~--r-g)2 

R  +  r 

Table  4.  Formulas  for  f2  for  Various  Types  of  Picture  Functions. 


4.5  Properties  of  f  and  f2 

Throughout  this  section  we  will  assume  that  the  region,  R,  within  which  we  are 
examining  a  picture  function,  is  the  entire  complex  plane,  C.  This  allows  a  slight 
notational  simplification.  For  instance,  we  will  write  M(q,c,p,C)  as  M(q,c,p). 

We  are  concerned  in  this  section  with  what  happens  to  our  various  measures  of  picture 
properties,  particularly  f  and  f^,  when  the  picture  undergoes  certain  simple  changes. 
The  first  change  we  will  deal  with  is  translation,  rotation,  and  scaling  of  a  picture. 
Specifically,  if  we  have  an  existing  picture  function,  p,  we  will  construct  a  new 
picture  function,  p  ,  which  takes  on  the  same  value  at  the  point  v(z  +  w)  as  p  takes  on 
at  the  point  z: 

p*(v(z  +  w))  =  p(z). 

Here  v  and  w  are  fixed  complex  numbers.  Adding  w  to  z  has  the  effect  of  translating 
the  picture  by  the  vector  w.  Multiplying  z  +  w  by  v  rotates  z  +  w  about  the  origin  by 
the  angle  of  v  and  scales  z  +  w  in  magnitude  by  the  factor  |  v  |.  These  operations 
are  illustrated  in  Figure  7. 


Figure  7.  How  Translation,  Rotation,  and  Scaling  are  accomplished  by 
Addition  and  Multiplication  of  Complex  Numbers. 


Let  z 


* 


=  v(z  +  w).  Then  the  following  relations  hold: 


p*(z*)  =  p*(v(z  +  w))  =  p(z),  by  definition 
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(z*  -  c)^  =  (vz  +  vw  -  vc/v)q  =  v^(z  -  (c/v  -  w))q 

dz*  =  |  v  1 2dz,  the  Jacobian  of  the  scaling  transformation  is  |  v  | 2 

so,  M(q,c,p*)  =  /  (z*  -  c)qp*(z*)d(z*) 

* 

z  0C 

=  vq  1 v 1 2  /  (z  -  (c/v  -  w))qp(z)dz 

zee 

(6)  =  vq  | v | 2  M(q,  c/v  -  w,  p) 

or,  alternatively,  =  vq  |v|2  /  $  (?)  zk(w  -  c/v)q-k p(z)dz 

zC  k=0  K 


(7)  =  vq  ! v  | 2  ?  (q)  (w  -  c/v)q"k  M(k,0,p) 

k=0  K 

Similarly,  S(c,p*)  =  J"  |z*  -  c|2  p*(z*)dz* 

* 

z  eC 

(8)  =  | v | ^  S(c/v  -  w,  p) 


Letting  q  be  0  and  1  in  (7)  we  find: 

M(0,c,p*)  =  |v|2  M(0,0,p) 

M(l,c,p*)  =  v  | v | 2  ((w  -  c/v)M(0,0,p)  +  M(  1 ,0,p) ) 


Therefore,  c*(p*)  =  c  +  ».B  ,J.  t  the  definition  of  the  centroid  of  p*  on  C 

M(0,c,p  ) 


=  c 


♦  v(w  -  c/v  *  MTgffipT1 


=  v(c  (p)  +  w) 


so  that  c*(p)  =  — ^ —  -  w 

This  last  relation,  when  plugged  into  (6)  and  (8),  makes  the  comparison  of  moments 
about  centroids  easy: 

M(q,c*(p*),p*)  =  vq  |v|2  M(q,c*(p),p) 

S(c*(p*)»P*>  *  I  v  | 4  S(c*(p),p) 
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The  relations  among  eigenvalues  and  f4  for  p  and  p  then  follow  immediately: 


Vp*> 

similarly,  X2(p*) 
and  f2(p*) 


ft(S(c*(p*),p*)  +  |M(2,c*(p*),p*) | )/M(0,c*(p*),p*) 

I v | 2  X^p) 

I v ! 2  x2(p) 

M(0,c,p*)2 

4Xj(p*)X2(p  ) 


=  f2(p) 

This  last  result  is  important  in  understanding  the  utility  of  the  f  function:  the  value 
of  f  does  not  change  when  a  picture  is  translated,  rotated,  and  scaled. 

The  f  function  has  one  other  useful  property  that  we  will  now  proceed  to  demonstrate: 
the  value  of  f  does  not  change  when  a  picture  is  stretched  along  any  single  axis.  This 
is  different  from  the  scaling  operation  treated  above  since  multiplication  by  a 
complex  number  scales  in  two  dimensions.  Here  we  are  referring  to  scaling  in  one 
arbitrary  dimension  while  leaving  the  orthogonal  dimension  unaltered.  This  type  of 
operation,  for  instance,  will  convert  a  circle  into  an  ellipse. 

We  have  already  seen  that  f(p)  is  unchanged  by  translation  and  rotation  of  the  picture 
function  p.  Therefore,  we  can  assume  that  the  picture  function  that  we  will  be 
stretching  has  its  centroid  at  the  origin  and  is  to  be  stretched  in  the  direction  of  the 
X  axis.  No  generality  is  lost  since  any  picture  function  can  be  translated  until  its 
centroid  is  at  the  origin  and  rotated  until  the  axis  to  be  stretched  lies  along  the  X 
axis. 


With  these  caveats  we  construct  from  our  picture  function,  p,  a  new  picture  function, 
p  ,  which  takes  on  the  same  value  at  the  point  sx  +  yi  as  p  takes  on  at  x  +  yi.  Thus 

p*(sx  +•  yi)  =  p(x  +  yi) 


Here  s  is  a  real  number  corresponding  to  the  amount  of  stretch  along  the  X  axis.  The 
effect  of  this  operation  is  illustrated  in  Figure  8.  Since  we  will  be  dealing  for  the 
first  time  with  integrals  in  the  real  variables  x  and  y  it  is  convenient  to  introduce 
some  new  notation: 


OO  QO 

I(m,n)  =  J"  J  )TYnp(x  +  yi)dxdy. 
-00-00 


Letting  z*  =  sx  +  yi,  the  following  holds: 

p*(z*)  =  p*(sx  +  yi)  =  p(x  +  yi)  =  p(z) 
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so, 


dz  =  sdxdy,  s  is  the  Jacobian  of  the  stretching  transformation 

M(q,0,p  )  =  /  z  q  p  (z  )  dz 
* 

z  eC 

=  s  £  (q)  skiq'kJ7  xkyq-kp(x+yi  )dxdy 

k =0  K 

(9)  =  E  (J)  sk+liq‘kl(k,q-k) 

k=0  K 

Letting  q  be  0  and  I  we  find: 

M(0,0,p*)  =  sl(0,0) 

=  sM(0,0,p) 

M(l,0,p*)  =  sil(0,l)  +  s2I(l,0) 
so  that  c  V)  =  ,  sll IM  +  HOti)  i 

M(0,0,p  )  1(0,0)  1(0,0) 


Figure  8.  The  Effect  of  Stretching  a  Picture  Function  by  a  Factor  s  in  the  X  Direction. 
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By  looking  at  our  original  definition  of  the  centroid,  c*(p),  it  should  be  clear  that 
I(1,0)/I(0,0‘  and  il(0,l)/l(0,0)  are  the  real  and  imaginary  parts,  respectively,  of  c  (p). 
Thus  the  above  expression  states  that  the  centroid  of  the  stretched  picture  function, 
c  (p  ),  is  obtained  from  the  original  centroid  by  multiplying  its  real  part  by  the 
stretch  factor  s.  Since  we  have  assumed  that  c  (p)  =  0,  it  follows  that  c  (p  )  =  0  as 
well.  This  simplifies  the  computation  of  higher  moments: 


M<2,c*(p*),p*)  =  M(2,0,p*) 

=  -sl(0, 2)  +  2is2l  (1,1)  +  s3I(2,Q) 

S(c*(p*),p*)  =  S(0,p*) 

=  s//s2x2  +  y2dxdy 
=  s3I(2,0)  +  sl(0,2) 

so,  S(c*(p*),p*)2  =  s6I(2,0)2  +  2s4I(2,0)I(0,2)  +  s2I(0,2)2 

and  |  M(2,c*(p*),p*)  |  2  =  s3I(2,0}2  -  2s4I(2,0)I(0,2)  ♦  s2I(0,2)2  +  4s4I(1,1)2 

Therefore,  S(c*(p*),p*)2  -  |  M(2,c*(p*),p*)  |  2  =  4s4(I(2,0)t(0,2)  -  1(1,  l)2) 


by  (9) 


Because  the  picture  function  p*  is  identical  with  p  when  s  is  set  equal  to  1,  it  follows 
that: 


S(c*(p),p)2  -  I  M(2,c*(p),p)  |  2  =  4(1(2, 0)1(0, 2)  -  1(1,  l,)2) 


We  use  these  relations  with  the  formula  for  f 2 


to  obtain  the  desired  result: 


f2(  *)  . _ M(0,c*(p*),pV _ 

P  '  S(c*(p*) ,p*)2-  l^/fpW 

_ st*M(0,c*(p),p)t> _ 

s4(S(c*(p) ,p)  -  |M(2,c*(p) ,p) |2) 

=  f2(p) 


The  fact  that  a  picture  may  be  stretched  along  any  axis  without  modifying  the  value 
of  f  and  f2  explains  some  of  our  earlier  findings.  A  rectangle  and  rhombus  have  the 
same  value,  36,  for  f  2  since  they  can  both  be  formed  by  stretching  a  square.  It  is  easy 
to  see  that  any  parallelogram  can  be  formed  from  a  square  in  this  manner  so  all 
parallelograms  will  have  f2  =  36.  Similarly,  any  triangle  can  be  formed  from  an 
equilateral  triangle  (a  regular  3-gon)  by  two  stretching  operations  and  appropriate 
translation,  rotation,  and  scaling.  Since  we  have  seen  that  f 2  for  a  binary  equilateral 
triangle  is  27,  that  value  is  the  same  for  any  triangle  represented  as  a  binary  picture 
function.  Applying  the  above  result  to  our  example  of  the  aligned,  equally  spaced 
rectangles,  we  can  see  why  the  value  of  f2  is  independent  of  the  rectangle's  height: 
changing  height  is  just  a  stretch  operation. 
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2 

The  invariance  of  f  under  stretching  of  the  picture  is  important  for  another  reason. 
Shapes  in  the  plane  appear  as  stretched  when  they  are  viewed  from  a  position  other 
than  the  perpendicular.  A  circle  on  the  ground,  for  instance,  is  seen  as  an  ellipse 
from  an  aircraft  which  is  not  directly  overhead.  Thus,  for  planar  shapes,  the 
invariance  of  f^  under  stretching  is  equivalent  to  the  insensitivity  of  its  shape 
recognition  properties  to  the  location  of  the  viewer. 

We  close  this  section  by  reporting  one  last  property  of  f  and  f^.  Suppose  the  picture 
function  p  is  multiplied  by  the  real  number  a  to  obtain  a  new  picture  function  ap 
where  ap(x)  =  a(p(x)),  then  f(ap)  =  I  a  I  f(p).  The  reader  is  invited  to  try  his  hand  at 
demonstrating  this  relatively  straightforward  fact. 

4.6  Generalized  Balanced  Ternary  (G^T) 

Until  this  point,  we  have  dealt  with  picture  functions  that  were  defined  at  every  point 
in  the  plane.  Now  we  will  deal  with  picture  functions  that  are  defined  only  on  a 
discrete  set  of  points  in  the  plane.  The  discrete  set  we  will  be  working  with  is  the  set 
of  points  that  are  addressable  via  the  2-dimensional  Generalized  Balanced  Ternary 
(GBT)  system.  GBT  is  described  at  some  length  in  the  appendix  to  this  report.  The 
salient  facts  upon  which  the  present  development  depends  are  stated  below. 

GBT  is  a  spatial  addressing  system  which  supports  the  addressing  of  a  fixed  hierarchy 
of  cells.  The  highest  addressable  cells  are  regular  hexagons  which  we  sometimes  call 
zeroth  level  aggregates.  The  next  higher  cells  all  consist  of  seven  hexagons  grouped 
as  one  central  hexagon  and  its  six  neighbors.  These  are  called  first  level  aggregates. 
In  general,  an  nth  level  aggregate  is  composed  of  a  central  (n-l)st  level  aggregate  and 
its  six  neighbors.  Thus,  an  nth  level  aggregate  contains  7n  hexagons.  Figure  9  shows 
a  third  level  aggregate  and  some  of  tl.n  addressing  notation. 

Since  the  distance  between  adjacent  hexagon  centers  is  l,  the  inner  radius  must  be 
ft.  If  follows  that  the  outer  radius  of  each  hexagon  must  be  1/^3  as  shown  in 
Figure  10. 

A  point  is  said  to  be  addressable  in  GBT  if  it  is  at  the  center  of  any  of  the  hexagons 
that  make  up  the  GBT  grid.  A  correspondence  between  points  addressable  in  GBT  and 
a  discrete  subset  of  the  complex  numbers  can  be  set  up  in  the  following  way.  The 
complex  numbers  0  and  1  mapped  to  the  centers  of  the  hexagons  whose  GBT  addresses 
are  0  and  1  respectively.  The  complex  number  i  is  mapped  so  that  the  vector  from  0 
to  i  is  orthogonal  to  the  vector  from  0  to  1  in  clockwise  rotation.  The  point  onto 
which  i  is  mapped  is  not  addressable  in  GBT.  All  other  complex  numbers  are  mapped 
linearly.  The  resulting  correspondence  between  GBT  addressable  points  within  a  first 
level  aggregate  and  complex  numbers  is  shown  in  Table  5. 


Complex  Numbers 

GBT  Addresses 

0 

0 

1 

1 

r  /3i) 

3 

Vj(-1  +  /3i) 

2 

-1 

6 

*(-l  -  /3i) 

4 

>4(1  -  *^3i ) 

5 

. 

Table  5.  The  Correspondence  between  Addresses  in  the  Central 
First  Level  Aggregate  and  Complex  Numbers. 


It  turns  out  that  the  set  of  complex  numbers  which  correspond  to  GBT  addressable 
points  under  this  mapping  is  closed  under  addition  and  multiplication.  This  means  that 
GBT  addressable  points  are  a  discrete  subring  of  the  complex  numbers.  The 
importance  of  this  algebraic  property  derives  from  the  fact  that  the  various  shape 
measures  that  we  have  described  earlier  depended  upon  the  multiplication  and 
addition  (plus  integration)  of  complex  numbers.  Thus,  we  have  reasonable  hopes  for 
applying  the  same  or  similar  formulas  to  GBT  based  discrete  picture  functions. 

There  are  two  other  reasons  for  considering  GBT  based  picture  functions.  One  has  to 
do  with  the  technology  for  converting  pictures  to  digital  form.  This  is  usually  done  by 
sampling  the  picture  over  3  small  region  with  some  sort  of  scanning  device  and 
returning  a  single  value  that  corresponds  to  an  average  of  the  picture  function  over 
that  region.  This  value  is  generally  viewed  as  being  assigned  to  the  centerpoint  of  the 
sample  region.  These  values  together  with  the  points  to  which  they  are  assigned  are 
often  called  pixels  (short  for  picture  elements).  Converting  a  picture  function  into 
pixels  by  a  scanning  process  effectively  discretizes  the  picture  fuction.  Thus,  GBT,  as 
a  discrete  system,  provides  a  good  framework  for  handling  digitized  images. 

The  last  reason  for  considering  picture  functions  based  on  GBT  is  the  natural 
aggregation  mechanism  GBT  provides.  Our  eventual  goal  is  automated  understanding 
of  the  content  of  images.  A  pixel,  in  itself,  provides  almost  no  information  on  the 
content  of  the  image  of  which  it  is  a  part.  Except  for  very  simple  cases,  however,  all 
of  the  pixels  taken  together  provide  too  overwhelming  a  mass  of  complicated 
information  to  be  easily  comprehended  by  a  computer.  The  GBT  aggregate  structure 
provides  a  sequence  of  intermediaries  between  these  two  extremes.  The  idea  is  that 
there  must  exist  aggregates  for  a  given  location  in  a  given  picture  that  contain 
enough  pixels  so  that  significant  entities  can  be  discerned  but  not  so  many  that  the 
complexity  is  overwhelming.  This  theme  of  using  aggregates  to  carve  a  picture  up 
into  understandable  portions  will  be  expanded  upon  considerably  in  the  sections  that 
follow. 


4.7  G  and  H  Functions 


In  this  section  we  wish  to  define  functions  that  are  the  discrete  analogues  of  the 
complex  moment  and  scalar  moment  functions  that  we  used  in  the  earlier  sections.  In 
what  follows  v  is  a  variable  GBT  addressable  point,  c  is  a  fixed  complex  number,  p  is  a 
picture  function  defined  on  GBT  addressable  points,  R  is  a  region  in  the  plane,  and  q 
and  m  are  nonnegative  integers.  By  analogy  with  the  functions  M  and  S  of  Section 
4.2,  we  define: 


and 


G(q,c,pm,R)  =  E 

(v-c)qp( v)m 

v€R 

H(c,p,R)  =  E 

1 v-c | 2p(v) . 

v£R 

In  order  to  use  these  functions  to  derive  some  of  the  useful  shape  descriptions  we 
have  been  using  in  previous  sections  (e.g.,  eigenvalues  and  f^),  we  must  find  their 
explicit  relationship  with  M  and  S.  The  catch  is  that  we  do  not  know  the  values  of  p 
except  at  the  GBT  addressable  points.  We  overcome  this  difficulty  as  follows.  First, 
we  define  a  new  picture  function  p  (z)  at  each  point  z?C  by  p  (z)  =  p(v)  where  v  is 
the  closest  GBT  addressable  point  to  z.  This  means  that  p  (z)  has  the  same  value  as 
p(v)  throughout  the  interior  of  the  hexagon  that  contains  v.  That  p*(z)  is  ill  defined 
on  hexagon  boundaries  does  not  matter  in  what  follows  since  the  hexagon  boundaries 
are  a  set  of  measure  zero.  Second,  we  insist  the  region  R  be  composed  of  whole 
hexagons.  Thus  if  v  is  in  or  out  of  R  then  all  points  interior  to  the  hexagon  which 
contains  v  are  in  or  out  of  R,  respectively.  Letting  h(v)  be  the  hexagon  that  contains 
v,  it  follows  that: 


M(q,c,p*m,R) 


=  ; 

(z-c)qp*(z)mdz 

2«R 

=  E 

/  (z-c)qp  (z)mdz 

veR 

zeh(v) 

=  Z 

p(v)m  /  ((z-v) 

+  (v-c))qdz 

v€R 

zeh(v) 

=  Z 

p(v)m  E  (£)(v-c)k 

/  (z-v)q_kdz 

veR 

k=0  k 

zeh(v) 

Now  the  integral  in  this  last  expression  is  just  the  (q-k)th  moment  about  its  center  of 
a  regular  6-gon  of  outer  radius  1  A'r3.  Its  values  are  given  as  formula  (3)  of  Section 
4.3.  Using  this  formula  for  q  less  than  6  we  see  that: 


/  (z-v)q_kdz  =  {  ^/2  ’ **en  k  =  q 
zeh(v)  0  otherwise 

Therefore,  when  q  is  less  than  6,  the  qth  moment  simplifies  as  follows: 


M(q,c,p*m,R)  =  E  p(v)m(v-c)q  >/3/2 
v£R 
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=  /3/2  G(q,c,pm,R). 

As  moments  higher  than  2  are  not  used  in  deriving  our  current  shape  descriptors,  this 
result  is  all  we  need.  The  relationship  of  the  S  function  to  G  and  H  functions  is 
derived  as  follows: 

S(c,p*,R)  =  /  |z-c |2p*(z)dz 
zeR 

=  E  p( v )  J  j(z-v)  +  (v-c))2dz 

veR  zeh(v) 

=  £  p(v)  /  | z-v ( 2  +  | v-c ( 2  +  (z-v)(v-c) 

veR  zeh(v) 

+  (z-v)(*"-c)dz 

=  £  p(v)  (/  |z-vj2dz  +  |v-c|2/dz 
veR 

+  (v-c)/(z-c)dz  +  (v-c)J(z-c)dz) . 

The  latter  two  terms  in  this  expression  vanish  since  the  first  moment  of  a  hexagon 
about  its  center  (and  its  complex  conjugate)  are  zero.  The  integral  in  the  term 
with  ]  v-c  |  i  is  /3/2  as  we  have  seen  above.  The  first  integral  can  be  evaluated  by 
formula  (4)  of  Section  4.3: 


/  |z-v|2dz 
zeh(v) 


Therefore,  S(c,p  ,R) 


5 

24/3 

Z  p(v)(—  ♦  I  v-cj  2  ?2) 
veR  24/5  £ 


/T  5 

=  -i  H(c,p,R)  +  — —  G(0,c,p,R) 
£  24/3 


=  ^(H(c,p,R)  +  G(0,c,p,R)) 

We  use  the  above  expressions  for  M  and  S  in  terms  of  G  and  H  to  find  the  new 
formulas  for  our  family  of  shape  descriptors. 

CV  r)  -  c  +  9jUc^i5i  - 

C  ,K'  '  C  +  G(0,c,p,R)  G(0,0,p,R) 

X  (p*,R)  =  M  +  *  +  ) 

G(0,c  ,p,R)  J  G(0,c  ,p,R) 
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A  (p*,R)  =  M  .  H(c  jtp,R)  +  5  _  lG,(.?tc  tfitR)]  ) 

G(0,c  ,p,R)  G(0,c  ,p,R) 

f 2(d*  r\  _  3  G( O.c  , p ,R) 
t  (P  ,R)  -  Tg  Xjtp.RJAjtp.R) 

|  G(0,c*,p,R)4 


(H(c  ,p,R)  +  G(0,c  ,p,R))  -  |G(2,c  ,p,R)| 


4.8  Agglomeration 


Suppose  we  want  to  compute  the  values  for  G  and  H  functions  for  a  picture  function, 
p,  about  a  point,  c,  in  a  region,  R.  Suppose  further  that  R  is  a  disjoint  union  of  t 
regions,  R.  (j  =  1,  **',  t),  and  we  know  the  values  of  G  and  H  functions  for  p 
about  a  point  Cj  in  each  region  Rj.  We  would  like  to  use  the  values  of  the  functions  in 
the  subordinate  regions,  Rj,  to  compute  their  values  on  R.  The  method  for 
accomplishing  this  is  derived  below: 


t 

G(q,c,pm,R)  =  E  E  (v  -  c.  +  c.  -  c)q  p(v)m 

j=l  v£R.  J  ] 

=  i  2  l  (q)(v-c.)k(c..c)q'kp(v)m 

j=l  vGR.  k=0  K  ]  ] 

-  1  (q)  Z  (c  -c^-^fk.c  ,pm,R  ) 
k=0  K  j=l  ]  J  1 

t  , 

H(c,p,R)  -  Z  E  |v  -  c.  +  c.  -  cr  p(v) 

j=l  v£R.  >  > 

=  E  Z  ( | v-c . | 2  +  | c . -c | 2  +  (v-c.)(c.-c) 

j=l  VER.  >  I  1  J 

+  (v-Cj )(Cj-c))p(v) 

t  , 

=  E  (H(C  p,R  )  +  |c  -cr  G(0,c.,p,R  ) 
j-I  ]  1  1  >  ] 


.  (c)-c)5(l,c.,p,Rj)  *(c,-c)  Gd.Cj.p.Rj)) 

The  utility  of  the  above  formulas  is  that  they  provide  a  means  of  computing  our 
family  of  shape  descriptors  for  a  region  without  having  to  deal  with  the  potentially 
large  volume  of  pixels  within  that  region.  All  that  is  required  is  the  knowledge  of  the 
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values  of  appropriate  G  and  H  functions  on  a  set  of  regions  Rj  that  partition  R.  This 
feature  is  particularly  well  suited  to  computations  within  the  GBT  aggregate 
hierarchy. 

Suppose  our  goal  is  to  compute  useful  moments  about  the  centers  of  GBT 
aggregates.  Inductively,  we  can  assume  that  we  have  computed  all  such  moments  for 
aggregates  at  level  n-1  and  below.  To  compute  the  moments  at  level  n,  we  have  only 
to  use  the  formulas  derived  above  based  on  the  partition  of  each  level  n  aggregate 
into  its  seven  level  n-1  subordinates.  The  computations  are  further  simplified  since 
the  terms  CyC  are  just  the  offsets  of  the  centers  of  the  subordinates  from  the  center 
of  the  parent  aggregate  and  are  therefore  known  a  priori.  In  the  implementation 
work,  this  was  the  principle  used  for  computing  moments  of  higher  cells.  It  is  far 
more  efficient  than  computing  all  moments  based  on  pixels. 


4.9  More  Examples 

To  illustrate  the  principles  discussed  above,  let  us  compute  our  family  of  shape 
descriptors  for  a  picture  function,  p,  that  assigns  the  value  1  to  all  GBT  addressable 
points  within  an  nth  level  aggregate,  A(n),  and  0  to  all  other  GBT  addressable  points. 
Accordingly  let  A. ,  j  =  0,  '  *,  6,  be  the  7  immediate  subordinates  of  A(n),  let 

c(n)  be  the  center  bf  A(n),  and  Cj  the  center  of  Aj.  Then, 

G(q,c(n),p, A(n))  =  Z  (?)  I  (c.-c(n))q'kG(k,c.  ,p,A. ) 

k=0  j=0  J  ]  ] 

-  Z  (?)G(k,c(n-l) ,p,A(n-l)>  £  (c-  -  c(n))q~k. 

k=0  K  j=0  ' 

The  latter  expression  follows  from  the  observation  that  all  G(k,C:,p,A:)  are  identical. 
But,  for  q-k  less  than  6: 


6 

£ 

j=° 


(c. 


c(n))q~k  =  { 


7  if  q-k  =  0 
0  otherwise 


This  fact  is  established  by  an  argument  similar  to  that  used  for  n-gons  in  Section  4.3. 
Thus,  for  q  less  than  6, 

G(q,c(n),p,A(n))  =  7G(q,c(n-l),p,A(n-l» 

=  7nG(q,c(0),p,A(0)) 

But,  G(q,c(0),p,A(0))  =  £  (v-c(0))qp(v) 

veA(O) 

=  (c(0)  -  c(0))q 

_  r  1  if  q  =  0 
1  0  otherwise 
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Therefore, 


G(q,c(n),p,A(n))  =  ^  q  if  0%  q  <  6. 

In  particular  we  have, 

M(0,c(n),p,A(n))  =  ^  7n 
M(l,c(n),p,A(n))  =  0 
c#(p,A(n))  =  c(n) 

and  M(2,c*,p,A(n))  =  M(2,c(n),p,A(n))  =  0. 

We  apply  similar  logic  for  the  H  function: 

6  - 

H(c(n),p,A(n))  =  £  (H{c.,p,A.)  +  |c. -c(n)  |  G(0,c.  ,p,A. ) 

j=0  1  ‘  ]  '  ’ 

+  (c.-c)  G(  1  ,c j  ,p,Aj )  +  (c.-c)  G(l,c. ,p,A. )) 

The  last  two  terms  vanish  since  gradients  of  full  aggregates  are  zero. 
Each  |Cj-c(n)  |  is  the  square  of  the  distance  from  the  center  of  a  level  n  aggregate 
to  the  center  of  its  jth  level  n-1  subordinate.  For  the  central  subordinate  this 
distance  is  zero.  For  the  six  others,  each  |c.-c(n)|2  =  7n_1.  We  have  seen  above 
that  G(0,c(n-l),p,A(n-l))  =  7n_1.  Therefore, 

H(c(n),p,A(n))  =  7H(c(n-l),p,A(n-l))  +  6  *  72n“2 

This  is  a  classical  recursion  of  the  form  r(n)  =  ar(n-l)  +  bcn  whose  solution  is 

n  n+l 

r(n)  =  anr(0)  +  b  -  — — - . 

a*C 

2  2 

Applying  this  solution  with  a  =  7,  b  =  6  x  7  ,  and  c  =  7  ,  we  find 
H(c(n),p,A(n))  =  7*11(0(0)  ,p,A(0))  +  7n_1(7n-l) 


=  7n'1(7n-l) 


since  H(c(0),p,A(0))  =  0. 


Applying  the  formulas  for  eigenvalues  and  f  from  Section  4.7  we  find: 

yP,A<n»  =  yP,A<n»  .  »  *  JS> 


36  x  7n  -  1 
2  x  7  x  6i 
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f  -  Wt  _  18  x  /?  x  7n+1 
2(X.X,r  36  x  7n  -  1 

1  L 

Values  for  f  and  f  •  for  aggregates  of  levels  0  through  10  are  given  n  Table  6.  Notice 
that  f2  on  a  level  zero  aggregate  is  38.88  as  we  would  expect  from  our  previous  work 
on  n-gons.  Judging  from  the  entries  in  the  table,  the  values  seem  to  be  decreasing 
with  n  and  converging.  By  checking  the  derivative  of  f  and  taking  the  limit,  the 
reader  can  confirm  that  f  decreases  as  n  increases  and  converges  to  7/3/2  (f2  to 
36.75)  as  n  goes  to  infinity. 


n 

f 

f2 

0 

6.23538 

38.88000 

1 

6.08633 

37.04341 

2 

6.06562 

36.79170 

3 

6.06267 

36.75595 

4 

6.06225 

36.75085 

5 

6.06219 

36.75012 

6 

6.06218 

36.75002 

7 

6.06218 

36.75000 

8 

6.06218 

36.75000 

9 

6.06218 

36.75000 

10 

6.06218 

36.75000 

Table  6.  Values  of  f  and  f 2  for  a  picture  function  which  is 

one  within  a  level  n  GBT  aggregate  and  zero  elsewhere. 


As  a  final  example  we  have  computed  the  value  of  f2  for  all  28  rotationally  distinct 
first  level  GBT  aggregates  and  indicated  them  in  Figure  11.  The  two  aggregates 
containing  a  single  darkened  hexagon  have  f2  =  38.88  as  we  would  expect  from  our 
results  on  n-gons  in  Table  3  and  our  results  on  level  zero  aggregates  in  Table  6.  The 
completely  filled  first  level  aggregate  has  f2  =  37.04  as  expected  from  Table  6.  For 
the  special  case  of  first  level  aggregates,  f2  differentiates  completely  between 
connected  and  disconnected  patterns.  Notice  that  each  aggregate  in  Figure  1 1  is 
connected  or  disconnected  if  its  f2  value  is  greater  or  less  than  15.  An  examination 
of  the  various  shapes  and  f2  values  gives  some  insight  as  to  the  behavior  of  f2  on  more 
general  binary  patterns. 
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13.18 


23.  96 


24.41 


10.10 


#  $  * 

25.99  5.  20  15.19 


Figure  11.  Values  of  for  the  28  Rotationally  Distinct  Classes 

of  Binary  Picture  Functions  on  First  Level  GBT  Aggregates. 
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4.10  Texture 


The  shape  descriptors  that  have  been  presented  in  the  proceeding  sections  are  quite 
powerful  in  identifying  and  separating  objects  from  their  background  in  an  image 
provided  the  picture  function  has  the  value  1  on  the  objects  and  0  on  the  background. 
In  most  applications,  however,  this  is  not  the  case.  In  a  photograph,  for  instance, 
what  our  eye  and  brain  recognize  as  a  single  entity  may  be  made  up  of  pixels  of  many 
different  values.  The  human  mind  classifies  different  regions  as  being  a  part  of  the 
same  entity,  not  because  the  pixels  in  the  regions  all  have  the  same  value,  but 
because  the  regions  have  the  same  overall  appearance.  Let  us  refer  to  the  notion  of 
the  general  appearance  of  a  region  of  a  picture  as  its  texture.  In  this  section  we  wish 
to  present  a  formal  definition  of  the  texture  of  a  discrete,  GBT  based  picture  function 
in  a  region  R.  In  the  chapter  on  implementation  we  will  show  how  this  definition 
relates  to  our  human  notion  of  visual  texture. 

In  what  follows,  let  A(n)  be  an  nth  level  GBT  aggregate  with  immediate 
subordinates  A- ,  j  =  0,  *  *  ",  6.  We  will  use  A(k)  <  A(n)  to  say  that  A(k)  is  a  kth 
level  aggregat4  that  is  contained  in  A(n).  First  let  us  relate  mean  and  variance  to  G 
functions.  Recalling  the  definitions  from  Section  4.2  and  relations  of  Section  4.7, 


y(p,A(n)) 


M(0,c,p,A(n))  _  G(Q,c,p,A(n)) 
M(0,c,p°,A(n))  G(0,c,p°,A(n)) 

—  G(0,0,p,A(n)) 


Var(p,A(n))  =  M(0,c,p2,A(n))/M(0,c,p°,A(n))  -  |i(p,A(n))2 


=  -i  G(0,0,p2,A(n))  -  -A-G(0,0,p,A(n))2 
7  7 

Another  useful  concept,  the  local  variance,  Var*,  of  p  on  A(n)  is  defined  by: 

Var*(p,A(n))  =  j  E  U(p,A.  )2  -  y(p,A(n))2. 
j=0  ' 

The  local  variance  ignores  the  fact  that  there  is  variability  within  the  immediate 
subordinates  Aj  of  Am).  It  would  be  the  true  variance  if  p  was  uniform  on  each  Aj, 
Suppose  v  is  a  GBT  addressable  point  and  we  have  a  picture  faction  p  defined  within 
a  universe  that  consists  of  a  level  m  GBT  aggregate  that  contains  v.  Then  we  define 
the  texture  of  p  at  v  as  follows: 

T(p,v)  =  (Var*(p,A(l)),  Var*(p,A(2)),  *  *  *,  Var*(p,A(m) ) ) 

where  each  A(k)  is  the  unique  kth  level  aggregate  containing  v.  For  a  region  R  .ve 
define  the  texture  of  p  in  R  by: 

T(p,R)  =  -TjL  E  T(p,v) 
n(R'  vCR 
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(10) 


where  n(R)  is  the  number  of  GBT  addressable  points  v  in  R.  Suppose  R  is  a  disjoint 
union  of  L  regions  Rj,  *  '  *,  R^.  Then, 

T(p,R)  =  rrrU  E  Z  T(p,v) 
j=0  v£R. 

L  n(R. ) 

■  sod"  T(p’V 

This  says  that  the  texture  of  a  region  partitioned  into  subregions  can  be  written  as  a 
convex  sum  of  the  subregion  textures  each  weighted  by  the  relative  size  of  the 
subregion. 

Applying  this  result  to  an  nth  level  aggregate,  A(n),  we  get: 

i  ** 

T(p,A(n))  =  i  s  T(p,A. ) 

j=0  J 

If  we  sum  up  the  first  n  components  of  T(p,A(n))  the  result  is  Var(p,A(n)).  To 
demonstrate  this  result  let  us  assume  that  it  is  true  for  all  aggregates  of  level  n-1  and 
below  and  proceed  by  induction: 


2  T(p,A(n))  =  2  *  E  T(p,A  ) 

k=l  *  k=l  j=0  }  K 

=  E  (  E  T(p,A.),  +  Var  (p,A(n)) 
/  ;  0  k=l  1  K 


=  i  2  Var(p,A.)  +  Var  (p,A(n)) 
j=0  1 

=  -i  E  G(0,0,p2,A  )  -  -J-r  E  G(0,0,p,A  )2 
7n  j  =0  J  7^n  j=0  J 

+  y  2  -J-yGCO.O.p.A.)2  -  4r  G(0,0,p,A(n))2 
'  j=0  7^'  ‘  1 


-  Var(p,A(n)) 

The  definition  of  texture  provided  here  makes  the  texture  at  a  GBT  addressable  point 
a  function  of  the  behavior  of  the  picture  function  in  all  of  its  superior  aggregates. 
Since  this  behavior  cannot  be  determined  by  looking  at  the  point  alone,  there  is  some 
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question  as  to  how  one  computes  texture.  The  method  is  suggested  by  formula  (10). 
Suppose,  inductively,  that  we  have  computed  the  first  k  components  of  the  texture 
vector  for  each  kth  level  aggregate  for  k  £  n-1  and  stored  it  at  that  aggregate 
address  together  with  the  mean  for  that  address.  Then  we  can  compute  the  first  n 
components  of  the  texture  for  A(n)  as  follows: 

1  6 

fork  <  n,  T(p,A(n))k  =  =  £  T(p,A.  ), 

k  7  j=0  J  k 

T(p,A(n))n  =  Var*(p,A(n)) 

=  i  £  y(p,A.  )2  -  Jj  (  E  p(p,A.))2 
j=0  J  j=0  > 

Proceeding  by  this  method,  each  nth  level  aggregate  knows  the  first  n  components  of 
its  texture  vector.  Since  the  GBT  data  access  scheme  requires  that  each  superior 
aggregate  of  an  nth  level  aggregate,  A(n),  be  traversed  to  reach  the  data  associated 
with  A(n),  the  texture  components  higher  than  n  can  be  retrieved  during  this 
operation.  In  this  manner  the  texture  of  any  aggregate  at  any  level  can  be  found  in  a 
manner  natural  to  the  GBT  data  management  process. 

The  discussion  of  texture  provided  above  leaves  one  major  question  unanswered.  What 
does  this  mathematical  formulation  have  to  do  with  our  human  notion  of  visual 
texture?  A  preliminary  answer  is  provided  in  the  next  chapter. 
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5.0  IMPLEMENTATION 


In  the  previous  chapter  the  notion  of  visual  texture  was  given  formal  mathematical 
definition.  In  addition,  a  family  of  measures  which  describe  the  behavior  of  picture 
finctions  over  areas  was  developed.  In  the  case  of  texture,  it  was  necessary  to 

compute  the  texture  for  a  variety  of  samples  to  see  how  this  formal  definition 

related  to  intuitive  ideas  about  texture.  Aggregate  descriptors  were  studied  by 
using  them  in  one  image  analysis  application,  raster  to  vector  conversion.  The  work 
that  was  done  attempted  to  find  and  demonstrate  the  utility  of  the  theoretical 
constructs  of  Chapter  4.0  to  problems  of  understanding  imagery.  The  results,  while 

preliminary,  indicate  the  richness  of  this  approach.  They  are  described  in  this 

chapter. 

5. 1  Texture 


In  order  to  examine  texture,  a  set  of  sample  images  was  collected.  These  were 
photocopied  from  photographs  to  insure  uniformity  of  resolution  in  the  image 
media.  The  samples  were  scanned  at  a  100  micron  resolution  using  an  Optronics 
C-4I00  Colorscan  System.  For  each  pixel  v,  a  gray  scale  value,  p(v),  was  assigned 
where  p(v)  is  an  integer  and  0  £  p(v)  £  255.  Figures  12  through  16  are  from 
electrostatic  plots  of  these  scans.  The  sample  textures  are  labeleo  texture  1 
through  texture  5. 

Each  scan  was  of  such  an  area  that  there  were  thirty-two  level  four  aggregates 
completely  filled  with  pattern.  Those  aggregates  on  the  boundary  which  were  not 
completely  filled  were  ignored. 

For  a  given  texture,  each  full  level  four  GBT  aggregate  was  assumed  to  be  a  sample 
of  the  texture.  Its  texture  vector  was  computed. 

Suppose  A  is  such  a  level  four  aggregate.  In  Section  4.10  the  texture  of  A  is  defined 
to  be  (Vj(p,A),  V2(p,A),  V3(p,A),  V^(p,A))  where  Vk(p,A)  is  the  mean  of  the  local 
variances  of  the  level  k  aggregates  subordinate  to  A  and  the  local  variance  of  a  level 
k  aggregate  is  just  the  variance  of  the  mean  of  p  over  each  of  its  seven  level  k-1 
subordinates.  Table  7  gives  the  mean  texture  vector  for  each  sample  together  with 
the  standard  deviation  in  each  component.  From  these  it  can  be  seen  that  the 
vectors  of  each  texture  cluster  together.  To  get  some  indication  of  the  separation 
of  these  clusters,  the  Mahal anobis  distance  was  used.  This  statistic  measures  the 
distance  in  terms  of  statistical  variation  to  the  mean  vector  of  a  multivariate 
normal  distribution.  It  is  defined: 

d(v,y)  =  (v  -  y)*  E'1  (v  -  y) 

where  y  is  the  mean  vector,  I  is  the  covariance  matrix,  and  v  is  a  vector.  Table  8 
shows  the  average,  maximum,  and  minimum  distances  from  the  vectors  of  one 
texture  to  the  mean  of  another.  It  was  found  that  for  samples  1,  2,  and  4,  if  v  is  a 
vector  of  texture  T  and  w  is  a  vector  of  a  different  texture,  v  is  closer  to  the  mean 
of  the  T  distribution  than  w  is.  While  such  a  result  does  not  hold  in  general  (textures 
3  and  5  do  not  have  this  property  with  respect  to  each  other),  it  does  indicate  that 
samples  of  like  texture  form  well-defined  clusters. 
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Mean 


Standard  Deviation 


Texture  1 

(234, 

313, 

191, 

78) 

(65, 

86, 

92, 

40) 

Texture  2 

(270, 

457, 

695, 

551) 

(83, 

95, 

161,  272) 

Texture  3 

(623, 

553, 

533, 

141) 

(343, 

82, 

68, 

56) 

Texture  4 

(476, 

673, 

200, 

24) 

(169, 

64, 

34, 

14) 

Texture  5 

(1258, 

746, 

356, 

48) 

(418, 

97, 

76, 

24) 

Table  7.  Mean  Vectors  and  Standard  Deviations  for  Sample  Textures. 


-  - 

Texture  1 

Texture  2 

Texture  3 

Texture  4 

Texture  5 

Texture  1 

mm 

Average 

1.8 

13.8 

7.9 

33.5 

Maximum 

3.0 

28.1 

23.6 

51.9 

Minimum 

0.7 

8.0 

wBSm 

3.8 

3.3 

Texture  2 

Average 

3.8 

1.9 

6.5 

8.2 

13.5 

Maximum 

10.8 

2.9 

17.0 

11.4 

18.6 

Minimum 

6.3 

1.1 

3.6 

— 

7.3 

6.0 

Texture  3 

Average 

13.8 

12.4 

1.8 

7.6 

4.7 

Maximum 

19.0 

22.1 

3.1 

10.3 

6.5 

Minimum 

6.7 

7.1 

0.8 

6.0 

2.7 

Texture  4 

Average 

8.5 

51.1 

17.9 

1.9 

8.5 

Maximum 

15.4 

96.7 

26.5 

3.3 

13.6 

Minimum 

3.7 

31.6 

12.0 

1.2 

6.2 

Texture  5 

Average 

6.2 

25.3 

6.0 

4.3 

1.9 

Maximum 

7.7 

53.6 

11.9 

5.3 

3.1 

Minimum 

3.2 

11.6 

2.5 

3.1 

1.2 

Table  8.  Mahalanobis  Distance.  A  row  contains  the  average, 
maximum,  and  minimum  Mahalanobis  distance  to  the 
mean  of  a  texture  as  computed  for  all  samples  of  the 
texture  represented  by  the  column. 
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Figure  13.  Sample  Texture  2.  From  a  Photograph  of  Pleated 
Fabric. 
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Figure  14.  Sample  Texture  3,  From  a  Photograph  ot 
Herringbone  Wool  Fabric. 
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Figure  15.  Sample  Texture  4.  From  a  Photograph  of  Nubby 
Wool  Fabric 
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Figure  16.  Sample  Texture  5.  From  a  Photograph  of  Straw. 
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5.2  Shape  Descriptors 

The  work  done  with  the  family  of  aggregate  descriptors  defined  in  Section  4.0  used 
existing  software  for  processing  binary  (0  or  1  gray  scale  valued)  images.  This 
software  takes  the  output  of  a  scan  file  and  populates  a  GBT  file.  I  was  enhanced 
to  compute  and  store  the  aggregate  descriptors  for  all  aggregates  at  levels  one 
through  four.  This  file  can  be  accessed  through  a  graphics  device.  Figures  17 
through  21  are  electrostatic  plots  of  displays.  The  operator  can  also  ask  for  the 
descriptors  of  a  fourth  level  aggregate  which  is  shown  on  the  screen. 

The  descriptors  were  used  in  an  algorithm  designed  to  replace  an  original  raster 
image  with  a  vector  representation.  This  algorithm  can  be  outlined  as  follows: 

1.  Take  a  raster  image  and  populate  a  GBT  file. 

2.  Compute  descriptors  among  them:  weight,  gradient,  the  eigenvalues  X 
and  Xjj  the  angle  of  the  second  moment. 

3.  Classify  each  aggregate  based  on  its  descriptors.  The  classes  are:  line 
segment,  line  edge,  full,  and  other. 

4.  Start  with  a  level  four  aggregate  A.  If  A  is  classed  as  a  line  segment,  fit  a  line 
to  A.  If  A  is  classed  as  a  line  edge  find  other  edges,  agglomerate  the 
statistics,  and  fit  a  line  to  the  rasters.  If  A  is  full,  do  nothing.  If  A  is  "other", 
look  at  its  level  three  subordinates.  Handle  them  according  to  their 
classifications.  The  result  of  this  algorithm  is  a  file  of  line  pieces  that  can  be 
displayed  graphically  to  see  how  well  the  aggregate  descriptors  represent  the 
raster  patterns. 

The  examples  described  here  are  taken  from  a  scan  at  a  50  micron  resolution  of  an 
electrical  circuit  diagram.  Figure  17  shows  a  horizontal  line  in  that  diagram  in 
raster  form  with  the  GBT  aggregates  overlaid.  Table  9  gives  the  values  of  some  of 
the  descriptors  for  the  fourth  level  aggregate  in  the  center  of  Figure  17.  Table  10 
shows  the  corresponding  values  for  aggregate  B  in  Figure  19  and  for  each  of  its 
seven  subordinates. 


Weight: 

338.0 

f2: 

34.9 

V 

203.7 

X2: 

3.0 

Angle  of  Second  Moment: 

180° 

Table  9.  Descriptors  for  Level  Four  Aggregate  A  in  Figure  17. 
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Figure  17.  A  Portion  of  a  Raster  Scanned  Diagram.  The 
Descriptors  for  the  Fourth  Level  Aggregate  A  are 
given  in  Table  9. 
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A  Portion  of  a  Raster  Scanned  Diagram.  The 
Descriptors  for  the  Fourth  Level  Aggregate  B  are 
given  in  Table  10. 
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Figure  21. 


Based  on  Descriptors,  Line  Segments  are  computed 
for  the  Subordinates  of  B. 
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Aggregate  B 


Weight: 


Angle  of  Second  Moment: 


484.0 

1.2 

22.16 

171.4 

.78° 


Subordinates 

_0_ 

_1_ 

2_ 

JL 

4 

_5_ 

_6_ 

Weight: 

21.0 

142.0 

130.0 

70.0 

117.0 

0.0 

4.0 

f2: 

16.0 

36.0 

33.6 

30.9 

32.5 

0.0 

15.6 

V 

12.8 

32.2 

30.2 

16.0 

17.2 

0.0 

1.7 

X  • 

2‘ 

.4 

3.3 

3.1 

1.9 

4.6 

0.0 

.1 

Angie  of  Second  Moment: 

175° 

178° 

179° 

173° 

O 

OO 

0° 

o 

OO 

Table  10.  Descriptors  for  Level  Four  Aggregate  B  in  Figure  19 
and  for  its  Subordinates  shown  in  Figure  20. 


The  line  fitting  algorithm  processes  aggregate  A  in  this  way: 


1. 


2. 


Check  Weight.  A  weight  of  338  corresponds  to  14%  filled,  neither  a  "full" 
aggregate  nor  so  little  as  to  make  the  other  measures  misleading. 


Check  f  .  An  of  34.9  indicates  a  pattern  which  is  connected  and  very 
rectangle-like  (recall  that  f 2  for  a  rectangle  is  36). 


3.  Consider  ^  and  1°  this  case  they  indicte  a  long,  thin  rectangle,  a  line 

segment. 


4.  The  angle  of  the  second  moment  is  180°  (from  the  vertical).  Thus  the  angle  of 
the  major  axis  is  half  180°  or  90°. 


5.  Approximate  this  pattern  with  a  horizontal  line  segment  passing  through  the 
centroid  of  the  pattern.  Figure  7  shows  the  line  segment  that  was  computed. 

6.  Since  the  scan  resolution  is  50  microns  and  X  -  3,  compute  and  store  an 
average  line  width  of  .0012  inch. 


This  technique  allows  the  rasters  in  this  portion  of  the  diagram  to  be  replaced  rather 
quickly  with  a  concise  description  of  the  pattern  which  they  represent.  In  this  way, 
all  of  the  simple  areas  of  the  diagram  can  be  converted  from  raster  form.  The  next 


example  demonstrates  how  more  complex  regions  are  handled  The  algorithm 
processes  aggregate  B  in  this  way. 

1.  Check  Weight.  A  weight  of  484  indicates  20%  of  the  aggregate  is  filled,  a 
significant  portion. 

2  2 

2.  Check  f  .  is  1.2  for  aggregate  B.  The  pattern  in  B  is  divided  into  disjoint 
pieces.  Because  of  this,  B  cannot  be  accurately  represented  with  a  single 
line.  Classify  B  as  "other"  and  process  its  seven  subordinates. 

3.  Check  Weight.  Those  subordinates  with  weight  sufficient  to  be  considered  are 
0,  1,  2,  3,  and  4.  Subordinates  5  and  6  are  less  than  5%  filled. 

2  2 

4.  Check  f  .  Subordinates  1,  2,  and  4  each  have  a  high  f  :  36,  33.6,  and  32.5. 

Subordinate  3  is  not  as  high  but  still  indicates  a  solid,  connected  piece. 

5.  Consider  and  In  subordinates  1,  2,  and  4  they  indicate  a  long,  thin 

shape  from  .0012  to  .0018  inches  wide.  These  subordinates  are  classified  as 

line  segments. 

6.  In  the  case  of  subordinate  3,  because  of  its  i£  and  ^  values,  the  centroid  of 

the  pattern  :s  examined.  It  lies  very  close  to  the  aggregate  boundary.  This 

subordinate  is  classified  as  a  line  edge. 

7.  The  last  step  in  this  process  tries  to  match  subordinate  3  with  "the  other  edge" 
of  the  line.  In  this  case,  0  and  3  are  agglomerated.  The  resulting  statistics 
indicate  a  line  segment  which  is  computed. 

Figure  21  shows  the  final  results  of  this  algorithm  on  aggregate  B. 

Preliminary  work  with  the  descriptors  defined  in  Chapter  4.0,  has  shown  that  these 
descriptors  can  be  used  to  understand  complex  images.  The  representations  obtained 
merely  by  substituting  line  segments  in  aggregates  have  been  much  more  accurate 
than  previous  vectorizations.  This  approach  is  also  fast  since  large,  simple  areas  of 
the  image  are  processed  at  one  time.  More  detailed  computations  are  reserved  for 
those  areas  where  complexity  is  indicated. 
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6.0  FUTURE  RESEARCH  DIRECTIONS 
6.1  Texture 


The  results  on  texture  presented  in  this  report  provide  a  solid  beginning  for  future 
research  and  development.  Texture  has  been  quantified;  important  properties  of 
that  quantification  have  been  demonstrated;  and  the  quantification  has  been  shown 
to  correspond  to  the  intuitive  idea  of  visual  texture.  What  remains  is  to  use  this 
quantification  of  texture  in  computers  to  decompose  complex  images  into  relatively 
simple  pieces  so  that  shapes,  properties,  and  relations  of  entities  in  the  image  can  be 
more  easily  analyzed.  This  will  require  a  major  effort  but  the  potential  rewards  are 
great. 

Table  11  shows  the  status  of  various  questions  regarding  needed  follow  on  work  on 
texture. 


Question 

Answer 

Comment 

Can  texture  be  defined 
for  a  discrete  picture 
function  in  a  region  of 
GBT  addressable  points? 

Yes.  And  the  texture  so 
defined  has  useful  math¬ 
ematical  properties. 

Accomplished  in  the 
current  research. 

Can  a  similar  definition 
be  provided  for  other 
sampling  and  aggrega¬ 
tion  schemes? 

Probably. 

Requires  more  research. 

If  so,  are  the  textures  of 
the  same  picture  func¬ 
tion  under  two  different 
schemes  comparable? 

Unknown. 

Requires  more  research. 

Can  a  similar  definition 
of  texture  be  provided 
for  continuous  picture 
functions  independent  of 
sampling? 

Unknown. 

Requires  more  research. 

Is  texture  related  to  the 
notion  of  fractal  func¬ 
tions? 

Probably. 

1 

! 

This  relates  to  the  pre¬ 
vious  question. 
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Question 

Answer 

Comment 

Does  texture  as  defined 
on  GBT  grids  correspond 
to  our  human  idea  of 
visual  texture? 

Preliminary,  yes. 

Accomplished  in  the 
current  research. 

How  can  textural  simi¬ 
larities  and  dissimilari¬ 
ties  be  measured? 

By  clustering  in  space 
and  probably  other 

methods. 

Spatial  clustering  was 
done  in  this  research. 

More  work  is  required. 

Can  a  computer  be  pro¬ 
grammed  to  automati¬ 
cally  decompose  an 
image  into  regions  of 
similar  texture? 

Probably. 

Requires  more  research. 

If  so,  will  such  a  decom¬ 
position  be  similar  to  a 
decomposition  provided 
by  a  person? 

Probably. 

Requires  much  research 
and  experimentation. 

If  it  is  achieved,  would 
such  a  decomposition  be 
suited  for  further  pro¬ 
cessing  to  derive  shapes, 
features,  and  relations 
of  entities  in  the  image? 

Yes. 

The  decomposition  pro¬ 
cess  would  be  directed 
at  that  goal.  However, 
much  research  and  ex¬ 
perimentation  is  re¬ 
quired.  | 

1 

Table  11.  Questions  and  Answers  about  Research  on  Texture. 


The  main  need  is  to  proceed  forward  from  the  current  base  to  achieve  techniques  for 
automated  image  decomposition  into  regions  of  similar  texture.  Other  issues 
relating  to  quantifying  texture  in  a  more  general  setting  also  need  to  be  examined. 


6.2  Shape 


During  this  research  we  developed  and  showed  the  properties  of  a  powerful  shape 
descriptor  called  f^.  We  further  showed  how  f^  and  other  shape  descriptors  could  be 
used  in  conjunction  with  the  GBT  cellular  hierarchy  to  analyze  shape  over  regions  of 
varying  size.  The  current  situation  is  that  we  have  developed  powerful  tools  but 
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have  not  yet  learned  how  to  use  them  well.  The  most  important  area  of  future 
research  is  to  learn  how  to  use  the  current  tools  efficiently  to  solve  a  variety  of 
image  understanding  problems.  Other  significant  areas  are  the  deriving  of  new 
shape  descriptors  and  the  development  of  a  knowledge  data  base  against  which 
descriptions  of  entities  in  images  can  be  compared. 

Table  12  shows  the  status  of  various  questions  regarding  needed  follow  on  work  on 
shape  descriptors. 


Question 

Answer 

Comment 

Are  there  functions 
which  are  useful  in  de¬ 
scribing  the  shape  of  a 
picture  function  within  a 
region? 

Yes.  Many  are  classical 
and  one  was  developed 
in  this  research. 

2 

The  f  function  and  its 
properties  are  described 
in  this  report. 

Could  there  be  other 
useful  functions  for  de¬ 
scribing  shape? 

Almost  certainly. 

Only  second  moments 
wpre  used  to  develop 
f  .  Use  of  higher  mo¬ 
ments  should  yield  other 
functions. 

Do  we  know  how  to  use 
the  current  functions  to 
best  advantage? 

No.  We  have  only  pre¬ 
liminary  results. 

Much  research  is  re¬ 
quired. 

Do  the  current  functions 
have  any  utility  inde¬ 
pendently  from  the  G^T 
system? 

Yes.  In  the  current  re¬ 
search  only  the  texture 
vector  is  GBT  depen¬ 
dent. 

GBT  aggregates  are  a 
natural  vehicle  for  com¬ 
puting  picture  function 
descriptors. 

Are  shape  descriptors  all 
that  is  needed  to  under¬ 
stand  entities  within 
images? 

No.  A  knowledge  data 
base  is  necessary  for  un¬ 
derstanding. 

The  current  work  pro¬ 
vides  only  for  the  under¬ 
standing  of  simple  geo¬ 
metric  shapes.  Much 

further  research  is  re¬ 
quired. 

Can  the  shape  descrip¬ 
tors  in  GBT  cells  be  ex¬ 
tended  to  arbitrary  re¬ 
gions? 

Almost  certainly.  A 

successful  first  effort 
was  made  during  this  re¬ 
search. 

More  research  in  this 
area  is  needed. 

Table  12.  Questions  and  Answers  about  Research  on  Shape  Descriptors. 
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8.0  APPENDIX.  GENERALIZED  BALANCED  TERNARY 


WHAT  IS  GBT? 


GBT  is  a  method  of  representing  a  two  dimensional  surface  that  enables  a  computer  to 
work  easily  with  data  distributed  on  that  surface.  When  a  person  looks  at  data  in  two 
dimensions,  a  map  sheet  or  a  photographic  image,  for  example,  he  quickly  sees  the 
general  content  of  that  map  or  photograph,  and  can  examine  certain  aspects  of  the 
content  in  detail  if  he  chooses  to  focus  his  attention.  Computers  which  use  conventional 
representations  for  planar  data  have  difficulty  performing  this  simple  chore.  Such 
systems  permit  easy  examination  of  the  smallest  components  of  the  data,  be  thev  pixels, 
bits,  bytes,  or  vectors,  but  have  no  efficient  mechanisms  for  examining  the  data  in 
aggregate  form.  As  the  saying  goes,  they  cannot  see  the  forest  for  the  trees. 

The  GBT  method  of  structuring  a  surface  permits  data  to  be  aggregated  so  that  an 
automated  system  can  examine  the  general  content  of  the  data  without  looking  at  the 
detail.  Because  of  th>s  capability,  an  algorithm  can  determine  at  a  high  level  if  the 
generalized  information  suffices  for  the  algorithm's  purpose  or  whether  finer  information 
is  needed.  In  the  latter  case,  the  GBT  structure  permits  the  layered  accessing  of  finer 
and  finer  data  until  the  finest  granularity  is  reached.  This  capability  for  selective  access 
to  successive  levels  of  detail  is  taken  for  granted  in  human  perception,  but  it  has 
presented  a  significant  problem  in  designing  systems  for  machine  perception.  It  is 
fundamental,  for  example,  to  the  notions  of  scene  analysis,  feature  extraction,  and 
pattern  recognition.  The  GBT  system  solves  this  problem. 

GBT  is  not  only  a  method  for  representing  space,  it  is  also  an  addressing  scheme  that 
allows  access  to  that  representation.  Further,  it  contains  an  algebraic  svstem  which 
operates  on  the  addressing  scheme.  These  aspects  of  GBT  will  be  described  below.  A 
GBT  structure  can  be  implemented  in  any  dimension,  however,  a  discusson  of  dimensions 
other  than  the  second  is  beyond  the  scope  of  this  paper.  GBT  stands  for  Generalized 
Balanced  Ternary,  a  term  signifying  that  GBT  is  the  higher  dimensional  analogue  of  the 
one  dimensional  system  known  as  balanced  ternary. 

THE  STRUCTURE  AND  ADDRESSES 


The  GBT  structure  is  one  of  a  hierarchy  of  cells.  At  each  level,  the  cells  are  constructed 
of  cells  from  the  previous  level  according  to  a  rule  of  aggregation.  The  basic  cells  of 
this  structure  are  hexagous.  Figure  1  shows  the  hexagonal  covering  of  the  plane.  This 
covering  has  the  uniform  adjacency  property,  that  is,  each  element  of  the  covering  is 
adjacent  to  exactly  six  other  elements  and  shares  with  each  exactly  one-sixth  of  its 
boundary.  In  contrast,  a  covering  of  the  plane  with  squares  does  not  have  uniform 
adjacencies.  Some  squares  are  adjacent  at  a  point  while  others  share  a  side. 

A  first  level  aggregate  is  formed  by  taking  a  single  hexagon  and  its  six  neighbors  (see 
Figure  2a).  The  first  level  aggregates  also  cover  the  plane  and  have  the  uniform 
adjacency  property.  In  general,  an  aggregate  at  level  n  is  formed  by  taking  a  level  n-1 
aggregate  and  its  six  neighbors.  It  can  be  shown  that  the  planar  covering  and  uniform 
adjacency  properties  hold  at  each  level.  Figures  2b  and  2c  show  second  and  third  level 
aggregates. 
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The  GBT  addressing  system  is  based  on  the  following  scheme.  In  an  aggregate,  the 
center  cell  is  labeled  0  and  the  outer  six  cells  are  labeled,  in  clockwise  order,  1,  3,  2,  6, 
4,  5  (see  Figure  2).  Each  hexagon  in  the  plane  has  a  unique  GBT  address,  a  sequence  of 
digits  corresponding  to  the  labels  of  the  cells  above  that  hexagon. 

Each  digit  of  the  address  corresponds  to  an  aggregate  level.  For  example,  the  address 
536  labels  the  hexagon  in  the  6  position  of  the  first  level  aggregate,  which  is  in  the  3 
position  of  the  second  level  aggregate,  which  is  in  the  5  position  of  the  third  level 
aggregate,  which  is  at  the  0  or  center  position  at  all  higher  levels.  The  hexagon  536  is 
shaded  in  Figure  3. 

The  digit  7  is  used  to  address  entire  aggregates  rather  than  hexagons.  Thus,  the  first 
level  aggregate  shaded  in  Figure  4  has  address  117.  The  second  level  aggregate  outlined 
in  the  same  figure  has  address  677. 

The  symbols  0,  1,  2,  3,  4,  5,  6,  and  7,  used  as  GBT  digits,  are  also  used  in  octal  and 
decimal  notation  to  express  integers.  Using  them  in  this  new  context  allows  GBT 
addresses  to  be  handled  directly  by  computer  hardware  and  software.  Care  must  be 
taken,  however,  not  to  confuse  the  addresses  with  integers. 
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FIGURE  1:  The  Hexagonal  Covering  of  the  Plane 
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Figure  2B:  Second  Level  Aggregate 


FIGURE  2:  The  Aggregate  Structure 


FIGURE  3:  The  Location  of  the  Hexagon 


THE  ARITHMETIC 


In  order  for  the  GBT  addressing  system  to  be  useful,  there  must  be  efficient  methods  for 
doing  planar  addition,  subtraction,  and  multiplication  entirely  in  terms  of  GBT 
addresses.  These  methods  are  discussed  below: 

1.  Addition:  Addition  of  GBT  addresses  parallels  integer  addition  in  that  if  am  addition 
table  for  the  seven  digits  is  given,  any  two  addresses  can  be  added.  From  Figure  5, 
we  can  derive  the  basic  GBT  addition  table.  Using  standard  planar  parallellogram 
addition,  we  see,  for  example,  that  1+2=3,  3+6=2,  and  5+6=4.  If  3  and  2  are  added, 
the  sum  is  outside  the  central  first  level  aggregate,  3+2=25.  Thus,  Table  1  is 
obtained.  Addition  of  multidigit  addresses  is  very  much  like  adding  multidigit 
integers.  For  example  (see  Figure  6),  to  add  153  and  45,  add  3+5  =  1,  then  5+4=52,  and 
carrying  the  5  to  the  next  column,  5+1  =  16.  Thus,  153+45=1621.  Figure  6B  shows 
these  vectors  in  the  plane. 

2.  Subtraction:  Subtraction  is  accomplished  by  the  process  of  complementing  and 
adding.  The  complement  of  a  GBT  address  is  its  digitwise  sevens  complement.  The 
complement  of  61542  is  16235,  for  example.  In  Figure  5,  we  see  that  3  and  4  are 
complements,  as  are  1  and  6,  and  53  and  24. 

3.  Multiplication:  GBT  multiplication  is  similar  to  addition  in  that  it  is  a  digitwise 
operation.  The  multiplication  table  (Table  2)  shows  that  the  GBT  product  of  two 
digits  is  just  their  integer  product  modulo  7.  An  example  of  multidigit 
multiplication  is: 


254  x  62: 

254 
x  62 

431  (=  2  x  254) 

523  (=  6  x  254) 

5261  (=  the  GBT  sum) 

Figure  7  illustrates  this  product  geometrically. 
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TABLE  1:  GBT  Addition 


Figure  6 A 
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THE  GBT  DATA  BASE  STRUCTURE 


Any  set  of  GBT  addresses  generates  a  tree  structure.  The  tree  derived  from  a  list  of  8 
four  digit  GBT  addresses  is  shown  in  Figure  8.  This  tree  originates  from  a  root  node 
which  corresponds  to  the  universal  cell  containing  all  addressable  cells.  Attached  to  the 
root  node  is  a  node  corresponding  to  the  first  of  a  set  consisting  of  those  next  lower  level 
cells  beneath  the  universal  cell  which  contain  any  of  the  cells  on  the  list.  Generally, 
each  node  in  the  tree  corresponds  to  a  cell  in  the  GBT  cellular  hierarchy.  Every  node 
within  a  particular  level  of  the  tree  structure  is  linked  to  its  siblings  in  sorted  order  by 
the  digit  of  the  GBT  address  corresponding  to  the  level.  The  first  node  within  the  list  is 
attached  to  the  parent  cell  for  that  level.  Any  given  node  can  have,  at  most,  seven 
subordinates,  but  may  have  fewer. 

This  tree  structure  provides  an  easy  way  of  accessing  data  by  general  location.  Figure  9 
shows  the  location  of  the  cells  whose  addresses  are  listed  in  Figure  8.  Suppose  someone 
were  to  ask  for  a  list  of  the  addresses  stored  in  the  tree  which  are  within  four  units  (1 
unit  is  the  distance  between  hexagon  centers)  of  4221.  A  simple  computation  shows  that 
any  such  address  must  be  a  subordinate  of  one  of  the  cells  4277,  6577,  6477,  and  43177. 
The  latter  cell  is  not  part  of  the  four  digit  universe  used  in  this  example,  and  therefore, 
is  not  checked.  To  check  the  contents  of  4277,  we  start  at  the  tree  root,  follow  a  pointer 
to  cell  4777,  then  to  cell  4277.  Then  we  follow  all  pointers  from  the  4277  node  to 
discover  the  addresses  4201,  4207,  4245,  and  4267.  These  are  placed  on  a  list  of 
candidate  addresses.  To  check  6477  and  6577,  we  first  descend  from  the  root  node  to 
cell  4777  then  to  the  node  corresponding  to  6777.  At  the  cell  6777  information  is  stored 
within  the  cell  describing  the  subordinates  to  the  cell.  It  is  at  this  point  we  discover  that 
there  is  no  pointer  to  6477,  so  there  can  be  no  stored  addresses  subordinate  to  that  cell. 
We  then  follow  the  pointer  from  cell  6777  to  cell  6077  and  from  there  to  cell  6017  and 
finally  to  cell  6577.  By  following  all  subsequent  pointers  from  cell  6577  we  locate  cells 
6562  and  6566  which  are  added  to  the  list  of  candidate  addresses.  The  next  step  is  to  use 
the  GBT  subtraction  operation  and  vector  length  measurement  to  eliminate  those 
addresses  on  the  list  which  are  more  than  four  units  from  4221.  This  step  eliminates 
4245  and  reduces  the  list  to  4201,  4207,  4267,  6562,  and  6566,  all  of  which  are  (at  least 
partially)  within  4  units  of  4221. 
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General  GBT  data  accesses  operate  in  the  same  manner  as  the  above  example.  If  a 
region  R  is  to  be  searched,  the  addresses  of  a  small  number  of  large  cells  which  cover  R 
are  determined.  Then  these  cells  are  searched  by  examining  their  subordinate  nodes  in 
the  data  structure  until  all  stored  subordinates  of  these  cells  are  accessed.  Then,  this 
list  is  checked  against  the  boundary  of  region  R  to  determine  which  cells  actually 
intersect  that  region.  This  is  a  relatively  direct  procedure  for  accessing  by  location  in 
that  it  avoids  any  consideration  of  addresses  which  lie  outside  the  set  of  cells  which 
cover  R.  This  locational  access  technique  is  not  of  great  utility  if  the  number  of 
addresses  stored  in  the  data  structure  is  small.  However,  in  large  data  bases,  maps  or 
imagery,  for  instance,  which  contain  multiple  millions  of  points,  some  such  access 
mechanism  is  imperative  to  rapid  operation. 

The  tree  structure  shown  in  Figure  8,  and  described  above,  is  wasteful  in  that  some  nodes 
exist  in  the  tree  solely  to  provide  a  linkage  between  a  larger  cell  and  a  very  small 
number  (1  or  2)  of  subordinates.  These  problems  can  be  overcome  by  the  structure 
indicated  in  Figure  10.  This  abbreviated  tree  structure  does  not  contain  separate  nodes 
for  many  of  the  parent  cells  of  addresses  which  are  stored.  Instead,  the  suffix  digits  of  a 
stored  address,  or  stored  addresses,  are  listed  in  a  node  corresponding  to  an  address 
which  may  be  several  levels  above  the  stored  addresses  in  the  hierarchy.  A  new  level  in 
the  abbreviated  tree  structure  is  created  only  when  the  number  of  cells  below  the  parent 
cells  becomes  large  enough  that  it  is  no  longer  efficient  to  use  list  processing  methods 
for  the  cells.  The  effect  of  the  abbreviated  tree  structure  is  that  stored  addresses  mav 
be  accessed  by  passing  through  fewer  nodes  than  in  the  full  tree  structure  of  Figure  8. 
The  locational  access  properties  of  the  full  tree  are  preserved  in  the  abbreviated  version. 
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FIGURE  10 


An  Abbreviated  Tree  Corresponding  to  the  same  Address  List  as  Figure  8 


The  address  terminator  lists  shown  in  boxes  are  pointed  to 
by  the  node  just  above  them. 
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